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OR LIGAND BINDING DOMAIN 
(57) Abstract 

me present invention relates to novel fusion proteins which activate , ^^^^^JX^ £SS 
their use in the genetic engineering of. f •^"^"IITSSS^ contein itte form muldmers ("bundles") 
which being a bundling domain. Bundling domains .nclude ^ d °^ tmOtat domain. Examples of bundling domatns 

! through motein-protein interactions w.th each other "fj** P££™ £ n £ tetramWi^on domain, the P 53 tetramerizauon 
' mat can be used in the practice of this invention include ^ ob ^ rvible bundling activity. Cells arc engineered by *e 

domain, a leucine zipper «»0™»v^. doma ^.^ n ^fS^r« m Some^es with additional nucleic acid constructs, to render 

r^s^^^^?^ Adm,nisnatio0 of me ugand 10 "* °* * en rcgu 

(positively, or in some case*, negatively) target gene transcription. 
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OR LI G AND BINDING DOMAIN 

BaCk9r i:" ™n7ct«on - a eukaryotic gene involves me — n of a vanety 
1 o, prote^t^rm a -rnpJthat is recced to the gene through M«» 
Key protein domains on one or more of the components include transection 
dorrains and DNA bind.ng domains. EM the mechanism of transcnpt.cn. .denting 
and characterizing components of the transcriptional machinery and ,n some cases 
5 ^ess nfs-eof thoTe components have been the subject of extensive research 
See. e.g. Brent and Ptashne. 1 985; Hope and Struh,. 1986; Keegan et a 1986 Raids 
and Song. 1989; Spencer et al. 1993. Be.shaw et a.. 1996 and R.vera et a.. 1996)(A 
Bibliography is provided just prior to the Examples, below.) 

inscription activation doma.ns are thought to function by reciting a numbe of 
,o proteins with specific functions to the promoter (tin and Green. 1991 ; Goodnch et al. 1993. 
Orphanides et al. 1996 and references cited therein; Ptashne and Gann. 1997 ^and 
references cited therein). Among the large number of activation doma,ns that have been 
characterized to date, the acidic-activation domain of the Herpes S.mplex v.rus encoded 
Proton VP1 6 is considered to be a very strong inducer of transcription and is w,de.y used 
2S E££ researcri (Sadowski et a,. 1988. Ptashne and Gann. 1997 * ™— J ^ 
activation domain of the P 65 subunit of the human transenption factor 
potent stimulator of gene expression, and in certain contexts can tnduce transenption mere 
strongly than VP16 (Schmitz and Baeuerie. 1991; Ba..ard et al. 1992; Moore at al. 1993. 
Z efal. 1994; Nalesan et a.. 1997). Both the VP16 and P 65 ~*™^e< 
30 thought to function by interacting with and recruiting a number of p£e,ns to the promoter 
(Cress and Triezenberg. 1990; Scmitz at al. 1994; Uesugi et al. 1997). 

One of the remarkable features of such activation domains is that 1us.ng ttem to 
heterologous protein domains seldom affects their ability to activate transenption .when 
• recruited to a wide variety of promoters, T*e high degree of functional 
35 exhibited by these activation doma,ns makes them valuable tools ,n vanous b^g ca 

assays for ana.yz.ng gene expression and protein-protein ^^^^^ 
molecule drug interactions (F.elds and Song. 1989; Senguptha et al, 1996. Ftivera et al. 
t996- Triezenberg. 1995 and references cited therein). The ability to activate gene 
l! P fesZ suong^ and when recruited to a wide range of promoters makes ^h p65 and 
40 VP1 6 attractive candidates tor activation of gene transcription in ^^Tli* 
applications. However, even more potent activation doma.ns. ,f avertable, would be 
for achieving higher levels of transcription on a per cel. basis, and for .mprovng the 
H£Z Z ^the'many biological assays that refy upon activation of transenption of a 
reporter gene. 
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Several strategies to improve the potency of activation doma.ns and thereby the 
expression of genes under their control have been reported (Emam. and Carey 1992. 
GerTL 1 994;Ohashieta..l994;Blairata.. 1996; Tanakaeta., 1996) These 
apples g enera.,y involve increasing the number of copies of activate doma.ns fused 
to The DNA binding domain or generating activators containing synergizinc , comb,nat,ons of 
acli^on domains. AUhough some abators generated by these methods have been 
shown to be more potent, a number of limitations predude their wdespread application. 
F^st potent activators comprising reiterated activation domains do not .crease he 
absolute levels of reporter gene expression when tested on promoters W .lh mulfple 
biding sL for the activator (Emami and Carey. 1992). Second, a number of synerg.st.c 
combinations of activation domains reported in the literature involve weak act.vat.on 
domains and the absolute levels of gene expression induced by these synergmng _ 
activation domains are mucb lower compared to potent addic adiv *«™*™™<™J™ e 
or P 65 (Gerber at al. 1994; Tanaka et al. 1996). Third, it is not known whether any o these 
potent adivation domains are capable of inducing gene transcription strongly when they 
are non-covalently linked to the DNA binding domain. Fourth, many potent ad.vators 
containing multiple copies of VP16 or other acidic activators are h.ghly toxic and/or 
accumulate to only low levels in the cell. 

As mentioned at the outset, a variety of important applications .nvoMng gene 
transcription require or would benefit from higher levels of gene express-on. As noted 
above, however, efforts to improve the potency of activation domains have been 
disappointing. Moreover, expression of various transcription activators revealed that 
obseL levels of more potent activators, such as the P 65 unit of NF-kB 
expeded. Without wishing to be bound by any one theory, we suggest that the more 
potent the activation domain, the more toxic it is to the cel.. the ,S * 

expression and/or the less of it is observed to accumulate in cells. How. then, .s .t 
possible to increase levels of heterologous gene expression? Remarkably, we have 
found that it is still possible to outmaneuver these fads of nature to improve heterologous 
gene expression and have in fad done so using the prindples of -bundUng". *ne 
engineering of the transcription activation domain, and combinations thereof, as descnbed 



below. 



35 



40 



Summary of the Invention . ,. . 

This document discloses new improvements in the des.gn and del.very of 
transcription adivation domains and provides improved materials and methods ifor 
regulating the transcription of a target gene. Aspeds of the invention are applicable to 
systems involving either cova.ent or non-covalent linking of the transcription activation 
domain to a DNA binding domain. 

Key features of the invention indude "bundling- domains, fusion proteins ">"ta,nmg 
them, recombinant nucleic acids encoding such fusion proteins, systems involvng bundles 
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oi such fusion proteins, and other materials and methods involving such bundling domains. 

Key tsL; J proLs of the invent contain at .east two mutually heterologous domains. 

one of which being a bundling domain. An important design concept is that the fusion 

p"te°nTdo not need to act alone, .nstead. they find and bind to each other (or w,th other 
5 prote,ns containing the bundling domain) to form a posse to accomplish their mission. In 
5 cells are engineered by the introduction of recombinant nucleic acds encod ng the 

fusion proteins, and in some cases with additionaf nucleic acid 

capable of ligand-dependent regulation of transcription of a target gene. Administration of 
, he ligand to the cells then regulates (positively, or in some cases, negatively) target gene 

1 o transcription. . 

Detailed information concerning bundling domains, guidance on the.r use and 
mutative examples are provided below. Generally speaking. 
any domain that induces proteins that contain it to form multimers ("bundles >J£«*» 
protein-protein interactions with each other or with other prote.ns 
1S domain Examples of bundling domains that can be used in the practice of th.s .nvention 
; domains such as the lac repressor tetramerization domain, the p53 
domain, a .eucine zipper domain, and domains derived therefrom ^ 
bundling activity. Proteins containing a bundling domain are capable of comptexmg wrth 
one another to form a bundle of the individual protein molecules. Such bundling .s 
-constitutive" in the sense that it does not require the presence of a cross-,^ ^gent 
(i.e.. a cross-linking agent which doesn't itself contain a proteinaceous bundling domain) to 

link the protein molecules. ;„~i,,riort 
mustrative (non-limiting) examples of heterologous domains which can be included 
along with a bundling domain in various fusion proteins of this invention include 
transcription regulatory domains (i.e.. transcription activation domains such as a p65. VP16 

domains such as an ssn-6/RJP-l domain or Kruppe. family suppressor domain); a OHA 
binding domain such as a GAL4. lex A or a composite DNA binding domain such as a 
composite zinc finger domain or a ZFHD1 domain; or a ligand-binding domain posing or 
30 derived from (a) an immunophilin. cyc.ophi.in or FRB domain; (b) an antibiotic binding 
domain such as tetR: or (c) a hormone receptor such as a progesterone receptor or 

^TI variety of ligand binding domains may be used in this invention, although 
.igand binding domains which bind to a cel. permeant ligand are preferred. It * also 
3S preferred that the ligand have a molecular weight under about 5kD. mo* 

2 5 kD and optimally below about 1500 D. Non-proteinaceous Ugands are also preferred. 
Uqand binding domains include, for example, domains selected or derived from (a) an 
tSSSlW FKBP 12). cydophi.in or FRAP domain; (b) a hormone recepto^ as 
a recepTor for progesterone, ecdysone or another steroid; and (c) an antibiotic receptor 
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such as a tetR domain for binding to tetracycline, doxycycline or other analogs or mimics 
thereof 

Examples of ligand binding domain/ligand pairs that may be used in the practice , of 
this invention include, but are not limited to: FKBP:FK1012. FKBP:synthetic divalent FKBP 
Iigands (see WO 96/0609 and WO 97/31898). FRB:rapamycin/FKBP (see e.g.. WO 
96/41865 and Rivera et al. "A humanized system for pharmacologic control of gene 
expression". Nature Medicine 2(9):1028-1032 (1997)). cyclophilinrcyclosponn (see e.g 
WO 94/18317). DHFR:methotrexate (see e.g. Ucitra etal. 1996. Proc. Natl. Acad. Sc.. USA 
93-12817-12821) TetR:tetracycline or doxycycline or other analogs or mimics thereof 
(Gossen and Bujard. 1992. Proc. Natl. Acad. Sci. U.S.A. 89:5547; Gossen et al. 1995. 
Science 268:1766-1769; Kistner et al. 1996. Proc. Natl. Acad. Sci. USA 93:10933-10938). a 
progesterone receptor:RU4S6 (Wang et al. 1994. Proc. Natl. Acad. Sc. USA 
91 -81 80-81 84). ecdysone receptor.ecdysone or muristerone A or other analogs or mim.cs 
thereof (No et al. 1 996, Proc. Natl. Acad.. Sci. USA 93:3346-3351) and DNA 
gyrase:coumermycin (see e.g. Farrar et al, 1996, Nature 383:178-181). 

A wide variety of DNA binding domains may be used in the practice of this 
invention, including a domain selected or derived from a GAL4. lexA or composite (e.g. 
2FHD1) DNA binding domain, or a DNA binding domain, e.g.. in combination wrth ligand 
binding domains such as a wt or mutated progesterone receptor domain. TetR domains 
which provide both DNA binding and ligand binding functions, are discussed in the context 
of ligand binding domains. In many applications it is preferable to use a DNA binding 
domain which is heterologous to the cells to be engineered. Heterologous DNA bmdmg 
domains include those which occur naturally in cell types other than the cells to be 
engineered as well as composite DNA binding domains containing component portions 
which are not 1ound in the same continuous polypeptide or gene in nature, at least not <n 
the same order or orientation or with the same spacing present in the composite domain. In 
the case of composite DNA binding domains, component peptide portions wh.ch are 
endogenous to the cells or organism to be engineered are generally preferred. 

In the case of the chimeric transcription factors containing a tetR domain, the DNA 
binding domain is provided by the tetR component, and is by its nature heterologous to 
eukaryotic cells. TetR domains are discussed in further detail in the context of ligand btnd.ng 
domains. 

in embodiments in which an endogenous gene is to be regulatably expressed, a 
composite DNA binding domain which is selected for recognition of one or more sequences 
upstream of the target gene may be deployed. 

Additional information concerning DNA binding domains is provided below. 
In an important application of this invention, two or more of the fusion proteins in the 
bundle each comprise, in addition to the bundling domain, at least one transection 
activation domain which is heterologous to the bundling domain. Bundling of protons 
containing transcription activation domains can significantly increase their effective potency 
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(relative to a single such fusion protein lacking a bundling domain) and consequently leads 
JSrong inducfon of gene expression. Unlike their counterparts .ac*,ng a bund ng doma.n. 
fusion proteins containing a bundling domain are designed to achieve effective local 
concentrations of transcription activation domains and to robustly induce gene expression 
; when recruited en masse to an expression control sequence^ven despite relatively Mow 
overall levels of expression or accumulation of the fusion proteins. Highly potent bundled 
activation domains can also be used in a wide variety of assays having transections, 
read outs Such assays indude assays for identifying protein-protein interacts (or 
inhibitors thereof) in a eukaryotic, preferably mammalian, twe-hybrid assay or vanant 
o thereof, e.g.. three-hybrid assay, reverse two-hybrid assay, etc. 

Bundling domains may be introduced into the design of fusion proteins of a vanety 
of regulated gene expression systems, including both allostery-based systems such as 
those regulated by tetracycline. RU486 or eodysone. or anatogs or mimics 
dimerization-based systems such as those regulated by divalent compounds l.ke FK1012 
l5 FKCsA rapamycin. AP1510 or coumermycin. or analogs or mimics thereof, all as desenbed 
below (See also. Clackson. 1997. Controlling mammalian gene expression with small 
mo/ecules.Current Opinion in Chem. Biol. 1:210-218). The fusion proteins may compnse 
any combination of relevant components, including bundling domains. DNA b.nd.ng 
domains, transcription activation (or repression) domains and ligand bindmg domatns. 
20 Other heterologous domains may also be included. 

Various embodiments of this invention involve fusion proteins wh.ch contam at 
least one bundling domain. DNA binding domain and transcription activation doma.n; at 
least one bundling domain, ligand binding domain and transcription repression doma.n; at 
least one bundling domain, ligand bind.ng domain and DNA binding doma,n; at least one 
25 bundling domain, ligand binding domain. DNA binding domain and transcription actrvaton 
domain; and. preferably, at least one bundling domain, ligand binding domarn and 
transcription activation domain. In currently preferred embodiments, these fusion , prote.ns 
represent improvements on the type described in W094/18317 and W096/41865. 
wherein the ligand binding domain is or is derived from a cyclophilin. immunoph.hn (e.g^an 
30 FKBP domain) or FRB domairv-elthough. any ligand binding domain may be used ,n the 
chimeric proteins, and the regulatory mechanism can be dimerization- or altostery-based. 

A preferred fusion protein contains a lac repressor tetramenzaton domam. an FRB 
domain and a transcription activation domain derived from the activation domain of human 
P 65 It should be appreciated that in any of the embedments of this invent, on mvolvng a 
35 fusion protein containing at least one transcription activation domain derived from p65 

whether with or without a bundling domain, the p65 peptide sequence may be a naturally 
occurring P 65 sequence or may be engineered as described below. 

Another aspect of this invention involves improvements in the transcnption 
activation domain itself. In this regard, recombinant nucleic acids are P^^^^ 
40 fusion proteins containing a transcription activation domain and at least one add.t.ona« 
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domain that is heterologous thereto, where the peptide sequence of the act.vat.on domain 
is itself modified relative to the naturally occurring sequence from which it was derived to 
increase or decrease its potency as a transcriptional activator relative to the counterpart 
comprising the native peptide sequence. Certain embodiments of this invention .nvolve 
fusion proteins containing a transcription activation domain derived from p65 and beanng 
one or more of the mutations shown in F.gure 7. Fusion proteins containing one or more 
modified activation domains can also contain a bundling domain to further increase the.r 
efficacy as transcriptional activators, and/or one or more additional doma.ns such as a 
ligand binding domain, DNA binding domain or transcription activation synergizing doma.n. 
such as are noted above and as discussed below. 

The invention thus provides recombinant nucleic acid constructs which encode the 
various proteins of this invention or are otherwise useful for practicing it. various DNA 
vectors containing those constructs for use in transducing prokaryotic and eukaryotjc cells, 
cells transduced with the recombinant nucleic acids, fusion proteins encoded by the above 
recombinant nucleic acids, and target gene constructs. 

Also provided are nucleic acid compositions comprising two or more recombinant 
nucleic acids which, when present within a cell, permit transcription of a target gene, 
preferably following exposure to a cell permeant ligand. These compositions are illustrated 

as follows: . . . 

Composition #1. A first such composition comprises a recombinant nucleic aao 

encoding a fusion protein comprising at least one ligand binding domain, bundling domain 

and transcription activation domain; a second recombinant nucleic acid encoding a fusion 

protein comprising a DNA binding domain and at least one ligand binding doma.n; and an 

optional third recombinant nucleic acid comprising a target gene (or cloning site) operatjvely 

linked to an expression control sequence including a DNA sequence recognized by the 

DNA binding domain mentioned above. Such compositions are illustrated by embodiment 

in which the ligand bindino domains are or are derived from immunophilin. cydophihn or FRB 

domains; the transcription activation domain is or is derived from an activation doma.n such 

as a VP1 6 or p65 domain; and the bundling domain is or is derived from a lac repressor 

tetramenzation domain. 

Composition #2. Another such composition is similar to Composition #1 except 
that the fusion protein encoded by the first recombinant nucleic acid compnses at least one 
ligand binding domain, bundling domain and DNA binding domain, and the fusion protein 
encoded by the second recombinant nudeic add comprises a transcnption act.vat.on 
domain and at least one ligand binding domain. 

Composition #3. Another such composition compnses a recombinant nucleic add 
encoding a fusion protein comprising at least one ligand binding domain, bundling domain 
and transcription activation domain; a second recombinant nucleic acid encoding a protein 
comprising a DNA binding domain; and an optional third recombinant nudeic add compns.ng 
> a target gene (or doning site) operatively linked to an expression control sequence 
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including a DN A sequence recognized by the DN A binding domain mentioned above. Such 
cLposUions are illustrated by embodiments in which the Hgand bindmg doma.ns are or ar 
derived from a receptor domain such as an ecdysone receptor the DNA binding domar us 
or is derived from a DNA binding domain such as an RXR protein, chosen for its abilrty to 
bind to the receptor domain in the presence of a ligand f cr that receptor; the transition 
activation domain is or is derived from an activation domain such as a VP16 or p6o domain; 
and the bundling domain is or is derived from a lac repressor tetramerization doma.n. 

Composition #4. Another such composition comprises a recombinant nucle.c acid 
encoding a fusion protein comprising at least one ligand binding domain. DNA binding 
domain, bundling domain and transcription activation domain (where the ligand b.nd.ng 
domain and DNA binding domain may be pan of or derived from the same domain); and an 
optional second recombinant nucleic acid comprising a target gene (or clon.ng site) 
operatively linked to an expression control sequence including a DNA sequence 
recognized by the DNA binding domain mentioned above. Such oompos.t.ons are 
illustrated by embodiments in which the ligand binding and DNA binding domains are or are 
derived from a receptor domain such as a tetracycline receptor which is capable of b.nd.ng 
to a characteristic DNA sequence in the presence of tetracycline or another hgand for the 
receptor- the transcription activation domain is or is derived from an activation domain such 
as a VP16 or p65 domain; and the bundling domain is or is derived from a lac ^pressor 
, tetramerization domain. Such compositions are further illustrated by embodiments ,n wh.ch 
the ligand binding domain is or is derived from a receptor domain such as a ip«g«ww 
receptor which is capable of binding to progesterone or analogs or m.m.cs thereof, .nc.ud.ng 
RU486; the DNA binding domain is or is derived from a GAL4 or composite DNA binding 
domain- the transcription activation domain is or is derived from an activation domain such 
5 as a VP16 or p65 domain; and the bundling domain is or is derived from a lac repressor 
tetramerization domain. . 

Composition #5. Another such composition, which unlike Compositions 1 - 4 is 
designed for constitutive expression rather than for ligand-mediated regulation of 
transcription, comprises a recombinant nucleic acid encoding a fusion protein oompns.ng at 
so least one DNA binding domain, bundling domain and transcription activation domain; and a 
second recombinant nucleic acid comprising a target gene (or cloning site) <*e«™*r 
linked to an expression control sequence including a DNA sequence recognized by the 
DNA binding domain mentioned above. Such compositions are illustrated by embodiments 
in which the transcription activation domain is or is derived from an activate doma.n such 
35 as a VP1 6 or p65 domain; the DNA binding domain is or is derived from a GAL4 or 
composite DNA binding domain; and the bundling domain is or is denved from a lac 
repressor tetramerization domain. „. „j„,„ 

Compositions 1 . 3. 4 and 5 may further comprise an additional recombinant nucleic 
acid encoding a fusion protein comprising a bundling domain and at least one transition 
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activation domain or transcription synergizing domain, with or without one or more optional 
additional domains. 

Each of the recombinant nucleic acids of this invention may further comprise an 
expression control sequence operably linked to the coding sequence and may be provided 
within a DNA vector, e.g.. tor use in transducing prokaryotic or eukaryotic cells. Some or all 
of the recombinant nucleic acids of a given composition as described above, including any 
optional recombinant nucleic acids, may be present within a single vector or may be 
apportioned between two or more vectors. In certain embodiments, the vector or vectors 
are viral vectors useful for producing recombinant viruses containing one or more of the 
recombinant nucleic acids. The recombinant nucleic acids may be provided as inserts 
within one or more recombinant viruses which may be used, for example, to transduce cells 
in vitro or cells present within an organism, including a human or non-human mammalian 
subject For example, the recombinant nucleic acids of any of Compositions 1 - 5. including 
any optional recombinant nucleic acids, may be present within a single recombinant virus 
or within a set of recombinant viruses, each of which containing one or more of the set of 
recombinant nucleic acids. Viruses useful for such embodiments indude any virus useful for 
gene transfer, including adenoviruses, adeno-associated viruses (AAV), retroviruses, 
hybrid adenovirus-AAV. herpes viruses, lenti viruses, etc. In specific embodtments. the 
recombinant nucleic acid comprising the target gene is present in a f irst virus and one or 
more or the recombinant nucleic acids encoding the transcription regulatory prote.n(s) are 
present in one or more additional viruses. In such multiviral embodiments, a recombinant 
nucleic acid encoding a fusion protein comprising a bundling domain and a transection 
activation domain, and optionally, a ligand binding domain, may be provided in the same 
recombinant virus as the target gene construct, or alternatively, on a third v.rus. It should 
be appreciated that non-viral approaches (naked DNA. liposomes or other lipid 
compositions, etc.) may be used to deliver recombinant nucleic acids of this invention to 

cells in a recipient organism. 

The invention also provides methods for rendering a cell capable of regulated 
expression of a target gene which involves introducing into the cell one or more of the 
recombinant nucleic acids of this invention to yield engineered cells which can express the 
appropriate fusion protein(s) of this invention to regulate transcription of a target gene. The 
recombinant nucleic acid(s) may be introduced in viral or other form into cells ma.ntamed in 
vitro or into cells present within an organism. The resultant engineered cells and their 
progeny containing one or more of these recombinant nucleic acids or nucleic acid 
compositions of this invention may be used in a variety of important applications 
discussed elsewhere, including human gene therapy, analogous vetennary applications, 
the creation of cellular or animal models (including transgenic applications) and assay, 
applications. Such cells are useful, for example, in methods involving the add.tion of a 
ligand. preferably a cell permeant ligand. to the cells (or administration of the ligand to an 
organism containing the cells) to regulate expression of a target gene. Particulariy important 
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animal models include rodent (especially mouse and rat) and non-human pnmate models, n 
qene therapy applications, the cells will generally be human and the peptide sequence of 
each of the various domains present in the fusion proteins (with the possible exception of 
the bundling domain) will preferably be. or be derived from, a peptide sequence of human 

5 ° n9 ' n ' in certain assay applications, recombinant nucleic acids are designed as described 
for Composition #1 . except that the ligand binding domains of the fusion proteins are 
replaced with protein domains that are known to bind to each other. Cells transduced wrth 
these recombinant nucleic acids and with a matched target gene construct express a target 
10 gene typically selected for convenience of measurement of expression level. These cells 
can be used to identify the presence of a substance which blocks the interaction of the two 
protein domains which are known to interact. 

In other 2-hybrid-type applications aimed at the identification of genes encoding 
proteins which interact with a protein or protein domain of interest, cells are transduced wrth 
« similar recombinant nucleic acids as described immediately above, except that a l.brary of 
test nucleic acid sequences of potential interest is cloned into one of the recombinant 
nucleic acids encoding one of the fusion proteins. A 2-hybrid style assay is conducted .n 
which transcription of the target gene indicates the presence of a test nucleic acid sequence 
which encodes a domain that interacts with the protein domain in the cognate fus»on 
20 protein. 

Reverse 2-hybrid-type assays may be conducted analogously using cells 
engineered to positively or negatively regulate expression of a reporter gene as a result of 
-2-hybricT formation. The cells are exposed to one or more test substances, and inhibition 
of regulation of expression is taken as an indication of possible inhibition of the 2-hybr,d 
25 lormation. 

Brief Description of the Figures 

Abbreviations used in the Figures: 
30 G = yeast GAL4 DNA binding domain, amino acids 1 -94 
F s human FKBP12. amino acids 1-107 
R = FRB domain of human FRAP, amino acids 2025-2113 

S = activation domain from the p65 subunit of human NF-kB, amino acids 361-550 
V = activation domain from Heipesvirus VP16. amino acids 410-494 
35 L= E coii lactose repressor, amino acids 46-360 

MT = Minimal Tetramerization (-bundling-) domain of E. ooi lactose repressor, amino acids 324- 

360 

FIG. 1 Diagram comparing various fusion proteins, with and without bundling domains, and 
40 their use in various strategies for delivery of activation domains to the promoter of a target 
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oene (A) two fusion proteins, one containing a DNA binding domain (e.g. a GAL4 or 
ZFHD1 DNA binding domain) fused to an FKBP12, and the other containing a p65 
activation domain fused to an FRB. are expressed in cells. Addition of rapamycin leads to 
the recruitment of a singe activation domain to ach DNA binding domain monomer. B) 
Fusion of multiple FKBPs to the DNA binding domain allows rapamycin to recruit multiple 
activation domains to each DNA binding domain monomer. (C) Addition of the lactose 
repressor tetramerization domain to the FRB-activation domain fusion allows rapamycm to 
recruit four activation domains to each FKBP fused to the DNA binding domain (D) 
Rapamycin recruits bundled activation domain fusion protein to each of the FKBP-DNA 
S domain fusion prote.ns. (E) and (F) illustrate a mutated tetR-based system, wthout 
and with bundling. (G) and (H) illustrate an engineered progesterone-R-based system, 
without and with bundling. 

FIG 2 Expression levels of the stably integrated reporter gene correlate with the number 
of activation domains recruited to the promoter. The indicated DNA binding domain and 
activation domain fusions were transfected into HT1080B cells containing a stably 
integrated SEAP reporter. Mean values of SEAP activity secreted into the medium following 
addLn of 10 nM rapamycin are shown <+A S.D.). In a«. cases. SEAP express.cn values 
are plotted for cultures receiving 1 00 ng of activation domain expression plasm.d. which 
gives peak expression values in transiently transfected cells and slightly below peak 
levels in the stably transfected cell line. 

FIG 3 Synergy between the activation domains in the RLS bundle is the primary cause 
for the super-activation of the reporter gene expression, a) Schematic illustration of the 
composition of the protein bundles of RLS with increasing concentration of sW^ssed 
LS orL in the cel.. b) Twenty nanograms of GFt encoding p.asm,d was co-transfected w,th 
100 ng of RLS alone or with indicated concentrations of LS or L regions. The cells were 
stimulated with 10 nM rapamycin and the SEAP activity in the medium was measured 1 8 
hrs after tiansfection. Mean values of SEAP activity secreted into the medium following 
, addition of rapamycin are shown (-/- S.D.). c) Western blot analyses us.ng 12CA5 

antibody against hemagglutinin epitope of various recombinant proteins expressed .n the 
transfected cells is shown. 

FIG 4 A thirty-six amino acid region in the carboxy terminal of the lactose repressor 
.5 protein is sufficient for generating highly potent and bundled activation doma,n fus-on 
" proteins HT1 080 B cells were co-transfected with 20 ng GF1 and 1 00 ng of ind.cated 
activation domain containing plasmid vectors. Transcription of the reporter gene : was^ 
stimulated by the addition of 10 nM rapamycin in the medium. Mean values of SEAP 
activity secreted into the medium assayed 24 hrs after transection are shown («■/- S.D.) 



40 



10 



WO 99/10510 



PCT/US98/17723 



FIG 5 Tethering bundled activation domain fusion proteins to DNA binding proteins 
significantly reduces the amount of reconstituted activators required to strongly stimulate 
the target gene expression, a) Twenty nanograms of GF4 and indicated concentrates of 
activation domain expressing plasmids were transfected into HT1080 B cells. Transcnpt.cn 
of the stably inteorated reporter gene was induced by the addition of 10 nM rapamycin in 
the medium, b) Western blot analysis of the relative expression levels of the transfected 
transcription factors, c) Twenty nanograms of GF4 and one hundred nanograms of the 
indicated activation domain fusion protein encoding plasmids were co-transtected into 
HT1080 B cells and the transcriptional activity of the GAL4 responsive reporter gene was 
induced by the addition of indicated concentrations of rapamycin in the medium. In all 
cases, mean values of SEAP activity secreted into the medium 24 hrs after the addition of 
rapamycin are shown (+/- S.D.). 

FIG. 6 Bundling the target-activation domain fusion protein improves the sensitivity of the 
two-hybrid assay in mammalian cells. Diagram showing two-hybrid assay using bundled 
fusion protein containing the target and activation domains. GAL4 DNA binding domain 
fused to c-Cbl (GCBL) is shown interacting with its target protein SH3 fused to either a) 
VP16 activation domain (SH3S) or b) lactose repressor tetramerization domain-VP1 6 
activation domain sequences (SH3MTS). c) HT1080 B cells containing stably integrated 
GAL4 responsive reporter gene were transfected with 100 ng of indicated expression 
plasmids. Mean values of SEAP activity secreted into the medium 24 hrs after transfection 
are shown (+/- S.D.). 

FIG. 7 Mutations for the p65 transcription activation domain are listed, including: 

1 . Mutations that are intended to increase activation potency, including M1 . M2. M6.M7 
andM8. 

2. Mutations that are intended to slightly decrease activation potency, includ.ng M4 and 
M5. 



Detailed Description of the Invention 



Definitions 

For convenience, the intended meaning of certain terms and phrases used herein 

are provided below. 

"Activate" as applied to the expression or transcription of a gene denotes a 
directly or indirectly observable increase in the production of a gene product, e.g.. an RNA 
or polypeptide encoded by the gene. 

"Capable of selectively hybridizing" means that two DNA molecules are 
o susceptible to hybridization with one another, despite the presence of other DNA 
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molecules under hybridization conditions which can be chosen or read.ly determmed 
empirically by the practitioner of ordinary skill in this art Such treatments include cond.t.ons 
of high stringency such as washing extensively with buffers containing 0.2 to 6 x SSC 
and/or containing 0.1% to 1% SDS, at temperatures ranging from room temperature to 65- 
75 o C See for example F.M. Ausubel et aL. Eds, Short Protocols in Molecular B.ology. 
Units 6.3 and 6.4 (John Wiley and Sons. New York. 3d Edition, 1995). 

-Cells" "host cells" or "recombinant host cells" refer not only to the part.cular 
cells under discussion, but also to their progeny or potential progeny. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are st.ll 
included within the scope of the term as used herein. 

"Cell line" refers to a population of cells capable of continuous or prolonged 
growth and division in vitro. Often, cell lines are clonal populations derived from a single 
progenitor cell. It is further known in the art that spontaneous or induced changes can occur 
in karyotype during storage or transfer of such clonal populations. Therefore, cells denved 
from Atari line referred to may not be precisely identical to the ancestral cells or cultures, 
and the cell line referred to includes such variants. 

"Composite", "fusion", and -recombinant" denote a matenal such as a nude.c 
acid nucleic acid sequence or polypeptide which contains at least two constituent porfions 
which are mutually heterologous in the sense that they are not otherwise found d.rectly 
(covalently) linked in nature, i.e., are not found in the same continuous polypept.de or gene 
in nature at least not in the same order or orientation or with the same spacng present .n 
the composite, fusion or recombinant product. Typically, such materials contam 
components derived from at least two different proteins or genes or from at least two non- 
adjacent portions of the same protein or gene. In general, "composite" refers to portions ^of 
different proteins or nucleic acids which are joined together to form a single f unct.onal un.t, 
while "fusion" generally refers to two or more functional units which are linked together. 
"Recombinant" is generally used in the context of nudeic acids or nucleic acd sequences. 

-Cotactor refers to proteins which either enhance or repress transcnption in a 
non-gene specific manner. Cofactors typically lack intrinsic DNA binding specificity, and 
function as general effectors. Positively acting cofactors do not stimulate basal 
transcription, but enhance the response to an activator. Positively acting cofactors -ndude 
PC1 PC2. PC3, PC4, and ACF. TAFs which interact directly with transcriptional act.vators 

are also referred to as cofactors. 

A "coding sequence" or a sequence which -encodes" a particular polypept.de or 
RNA is a nudeic add sequence which is transcribed (in the case of DNA) and translated 
(in the case of mRNA) into a polypeptide in vitro or in vivo when placed under the control 
of an appropriate expression control sequence. The boundaries of the coding sequence 
are generally determined by a start codon at the 5" (amino) terminus and a translation stop 
0 codon at the 3' (carboxy) terminus. A coding sequence can indude. but is not Um.ted to. 
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cONA from procaryotic or eukaryotic mRNA, genomic DNA sequences from procaryotic or 
eukaryotic DNA, and synthetic DNA sequences. A transcription termination sequence w,ll 
usually be located 3' to the coding sequence. 

The term "conjoint", with respect to administration of two or more viruses, refers to 
the simultaneous, sequential or separate dosing of the individual virus provided that some 
overlap occurs in the simultaneous presence of the viruses in one or more cells of the 
animal. 

A "construct", e.g.. a "nucleic acid construct" or "DNA construct", refers to a 

nucleic acid or nucleic acid sequence. 

"Derived from" denotes a peptide or nucleotide sequence selected from within a 
given sequence. A peptide or nucleotide sequence derived from a named sequence may 
further contain a small number of modifications relative to the parent sequence, in most 
cases representing deletion, replacement or insertion of less than about 15%. preferably 
less than about 10%. and in many cases less than about 5%. of amino acid res.dues or 
bases present in the parent sequence. In the case of DNAs. one DNA molecule is also 
considered to be derived from another if the two are capable of selectively hybridizing to 
one another. Polypeptides or polypeptide sequences are also considered to be denved 
from a reference polypeptide or polypeptide sequence if any DNAs encoding the two 
polypeptides or sequences are capable of selectively hybridizing to one another. 
Typically a derived peptide sequence will differ from a parent sequence by the 
replacement of up to 5 amino acids, in many cases up to 3 amino acids, and very often by 
0 or 1 amino acids. A derived nucleic acid sequence will differ from a parent sequence by 
the replacement of up to 15 bases, in many cases up to 9 bases, and very often by 0 - 3 
bases. In some cases the amino acid(s) or base(s) isVare added or deleted rather than 
replaced. 

"Domain" refers to a portion of a protein or polypeptide. In the art. the term 
-domain" may refer to a portion of a protein having a discrete secondary structure. 
However, as will be apparent from the context used herein, the term "domain- as used in 
this document does not necessariy connote a given secondary structure. Rather, a 
peptide sequence is referred to herein as a "domain" simply to denote a polypeptide 
sequence from a defined source, or having or conferring an intended or observed activity. 
Domains can be derived from naturally occurring proteins or may compnse non-naturally- 
occurring sequence. . . 

"DNA recognition sequence" means a DNA sequence wh.ch is capable of 
binding to one or more DNA-binding domains, e.g.. of a transcription factor or an engineered 
polypeptide. 

"Expression control element", or simply "control element", refers to DNA 
sequences, such as initiation signals, enhancers, promoters and silencers, which induce or 
control transcription of DNA sequences with which they are operably linked. Control 
) elements of a genemay be located in introns. exons. coding regions, and 3' flanking 
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sequences. Some control elements are 'tissue specific'. Le.. affect express-on of the 
selected DN A sequence preferentially in specific cells (e.g.. cells of a specfc fssue), wh,le 
others are active in many or most cell types. Gene expression occurs preferentially m a 
specific cell if expression in this cell type is observably higher than express.on .n other cell 
types Control elements include so-called 'leaky- promoters, which regulate expression of 
a selected DN A primarily in one tissue, but cause expression in other tissues as well. 
Furthermore, a control element can act constitutive* or inducibly. An inducible promoter for 
example, is demonstrably more active in response to a stimulus than in the absence of that 
stimulus. A stimulus can comprise a hormone, cytokine, heavy metal, phorbol ester, cychc 
AMP (cAMP) retinoic acid or derivative thereof, etc. A nucleotide sequence conta.n.ng one 
or more expression control elements may be referred to as an "expression control 
sequence". 

-Gene" refers to a nucleic acid molecule or sequence comprising an open read.ng 
frame and including at least one exon and (optionally) one or more intron sequences. 

"Genetically engineered cells" denotes cells which have been modrf.ed by the 
introduction of recombinant or heterologous nucleic acids (e.g. one or more DNA constructs 
or their RNA counterparts) and further includes the progeny of such cells which reta.n part 
or all of such genetic modification. 

"Heterologous", as it relates to nucleic acid or peptide sequences, denotes 
sequences that are not normally joined together, and/or are not normally assocated wrth a 
particular cell. Thus, a 'heterologous' region of a nucleic acid construct is a segment of 
nucleic acid within or attached to another nucleic acid molecule that is not found .n 
association with the other molecule in nature. For example, a heterologous reg.on of a 
construct coufd include a coding sequence flanked by sequences not found in assoc.at.on 
with the coding sequence in nature. Another example of a heterologous ceding sequence 
is a construct where the coding sequence itself is not found in nature (e.g.. synthetic 
sequences having codons different from the native gene). Similarly, in the case of a cell 
transduced with a nucleic acid construct which is not normally present .n the cell, tne cell 
and the construct would be considered mutually heterologous for purposes of th.s 
invention. Allelic variation or naturally occurring mutational events do not g.ve nse to 

heterologous DNA, as used herein. 

"Initiator" refers to a short, weakly conserved element that encompasses tne 
transcription start site and which is important for directing the synthesis of property init.ated 
transcripts. 

"Interact" refers to direct* or indirectly detectable interactions between molecules, 
such as can be detected using, for example, a yeast two hybrid assay or by 
immuno P recipitat.on. The term "interact" encompasses 'binding' interactions between 
molecules. Interactions may be. for example, protein-protein. protein-nucte,c aad. proteuv 
small molecule or small molecule-nudeic acid in nature. 
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"Minimal promoter" refers to the minimal expression control element that is 
capable of initiating transcription of a selected DNA sequence to which it is operably tanked. 
A Lima, promoter frequently consists of a TATA box or TATA-like box. Numerous m,n,ma» 
promoter sequences are known in the literature. 

-Nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), 
" and where appropriate, ribonucleic acid (RNA). The term should also be understood to 
include derivatives, variants and analogs of either RNA or DNA made from nucleot.de 
analogs, and, as applicable to the embodiment being described, single (sense or 
antisense) and double-stranded polynucleotides. 
10 "Operably linked" when referring to an expression control element and a cod.ng 

sequence means that the expression control element is associated with the coding 
sequence in such a manner as to permit or facilitate transcription of the coding sequence. 

A '•recombinant virus" is a virus particle in which the packaged nuclei acd 
contains a heterologous portion. 

-Protein" "polypeptide" and "peptide- are used interchangeably. 
A "target gene" is a nucleic acid of interest, the expression of which is modulated 
according to the methods of the invention. The target gene can be endogenous or 
exogenous and can integrate into a cell's genome, or remain episomal. The target gene can 
encode, for instance, a protein, an antisense RNA or a ribozyme. 
20 The terms "transcriptional activation unit" and "activation unit", refer to a 

peptide sequence which is capable of inducing or otherwise potentiating transition 
activator-dependent transcription, either on its own or when linked covalently or non- 
covalently to another transcriptional activation unit. An activation unit may contain a mmimal 
polypeptide sequence which retains the ability to interact directly or indirectly with a 
25 transcription factor. Unless otherwise dear from the context, where a 

referred to as "including" or -comprising- an activation unit, it will be understood that other 
portions of the protein from which the activation unit is derived can be included. 
Transcriptional activation units can be rich in certain amino acids. For example, a 
transcriptional activation unh can be a peptide rich in acidic residues, glutamine. proline, or 
30 serine and threonine residues. Other transcriptional activators can be rich in isoleucine or 
basic amino acid residues (see. e.g.. Triezenberg (1995) Cur. Opin. Gen. Develop. 5:190. 
and references cited therein). For instance, an activation unit can be a peptide motif of at 
least about 6 amino acid residues associated with a transcription activation domain 
including the well-known -acidic", "glutamine-rich" and -proline-rich" motifs such as the K13 
35 motif from p65. the OCT2 Q domain and the OCT2 P domain, respectively. 

The term "transcriptional activator" refers to a protein or protein complex, the 
presence of which can increase the level of gene transcription in a cell of ^sponsive, 
gene It is thought that a transcriptional activator is capable of enhancing the efficiency 
with which the basal transcription complex performs, i.e.. activating transcription. Thus, as 
40 used herein, a transcriptional activator can be a single protein or alternatively it can be 
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composed of several units at least some of which are not covalently linked to each other. 
A transcriptional activator typically has a modular structure, i.e.. comprises one or more 
component domains, such as a DNA binding domain and one or more transcnptional 
activation units or domains. Transcriptional activators are a subset of transcription factors, 
defined below. 

'Transcription factor" refers to any protein whose presence or absence 
contributes to the initiation of transcription but which is not itself a part of the polymerase. 
Certain transcription factors stimulate transcription (-transcriptional activators"); other 
repress transcription (transcriptional repressors"). Transcription factors are generally 
classifiable into two groups: (i) the general transcription factors, and («) the transcription 
activators. Transcription factors usually contain one or more regulatory domains. Some 
transcription factors contain a DNA binding domain, which is that part of the transcnpt.on 
factor which directly interacts with the expression control element of the target gene. 

"Transcription regulatory domain" denotes any domain which regulates 
transcription, and includes activation, synergizing and repression domains. The term 
-activation domain" denotes a domain, e.g. in a transcription factor, which pos,t.vely 
regulates (increases) the rate of gene transcription. The term -repression domain" denotes 
a domain which negatively regulates (inhibits or decreases) the rate of gene transcription. 

A " transcription synergizing domain" is defined as any domain which 
increases the potency of transcriptional activation when present along w.th the 
transcription activation domain. A synergizing domain can be an independent 
transcriptional activator, or alternatively, a domain which on its own does not .nduce (or 
does not usually induce) transcription but is able to potentiate the activity of a transition 
activation domain. The synergizing domain can be a component domain of a fus.on protein 
containing the activation domain or can be recruited to the DNA binding domain or other 
component of the transcription complex, e.g.. via a bundling interaction. 

Transf ection" means the introduction of a naked nucleic acid molecule into a 
recipient cell. "Infection" refers to the process wherein a nucleic acid is introduced into a 
cell by a virus containing that nucleic acid. A 'productive infection' refers to the process 
wherein a virus enters the celt, is replicated, and is then released from the cell (sometimes 
referred to as a 'lytic' infection). 'Transduction" encompasses the introduction of nucleic 
acid into cells by any means. 

-Transgene- refers to a nucleic acid sequence which has been introduced into a 
celt Daughter cells deriving from a cell in which a transgene has been introduced are also 
said to contain the transgene (unless it has been deleted). The polypeptide or RNA 
encoded by a transgene may be partly or entirely heterologous, i.e.. foreign, with respect 
to the animal or cell into which it is introduced. Alternatively, the transgene can be 
homologous to an endogenous gene of the transgenic animal or cell into wh.ch it is 
introduced, but is designed to be inserted, or is inserted, into the animal's genome in such a 
, way as to alter the genome of the cell into which it is inserted (e.g.. it ts inserted at a 
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,ocation which differs from that of the natural gene). A transgene can also be present ^ an 
episome A transgene can include one or more expression control elements and any other 
nucleic acid. (e.g. intron). that may be necessary or desirable for optimal express.on of a 
selected coding sequence. 

The term "vector" refers to a nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of vector is an episome. i.e.. a nucle.c 
acid capable of extra-chromosomal replication. Often vectors are used which are capable 
of autonomous replication and/or expression of nucleic acids to which they are linked. 
Vectors capable of directing the expression of an included gene operatively linked to an 
expression control sequence can be referred to as "expression vectors". Express.on 
vectors are typically in the form of "piasmids" which refer generally to circular double 
stranded DNA loops which, in their vector form are not bound to the chromosome. In the 
present specification, 'plasmid' and 'vector- are used interchangeably as the plasmid .s 
the most commonly used form of vector. However, the invention is intended to include such 
other forms of vectors which serve equivalent functions and which are or become known in 
the art. Viral vectors are nucleic acid molecules containing viral sequences which can be 
packaged into viral particles. 
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Bundling domains „ . ■ 

As described above, bundling domains interact with like doma.ns via protein-protein 

interactions to induce formation of protein bundles". Various order oligomers (dimers, 
trimers. tertramers. etc.) of proteins containing a bundling domain can be formed, depending 

on the choice of bundling domain. 

One example of a dimerization domain is the leucine zipper (LZ) element Leucrne 
zippers have been identified, generally, as stretches of about 35 amino acids «rt**io £ 
5 leucine residues separated from each other by six amino acids (Maniatis and Abel (1989) 
Nature 341 24-25). Exemplary leucine zippers occur in a variety of eukaryotic DNA 
binding proteins, such as GCN4, C/EBP. c-Fos, c-Jun. c-Myc and c-Max. Other 
dimerization domains include helix-loop-helix domains (Murre. C. et al. (1989) Cell 58:537- 
544) Dimerization domains may also be selected from other proteins, such as the retinoic 
acid receptor, the thyroid hormone receptor or other nuclear hormone receptors (Kurokawa 
et al (1993) Genes Dev. 7:1423-1435) or from the yeast transcription factors GAL4 and 
HAP1 (Marmonstein et al. (1992) Nature 356:408^14; Zhang et al. (1993) Proc. Natl. Acad. 
Sci. USA 90:2851-2855). Dimerization domains are further descnbed in U.S. Patent No. 
5,624,81 8 by Eisenman. 
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Of particular current interest are tetramer-forming bundling domains. Incorporation of 
such a tetramerization domain within a fusion protein leads to the constitutive assembly of 
tetramenc clusters orbundles. For example, a bundle of four activation units can be 
assembled by covaiently linking the activation unit to a tetramerizat.on domain. By 
clustering the activation units together through a bundling domain, four activation units can 
be delivered to a single DNA binding domain at the promoter. The E. coli lactose repressor 
tetramerization domain (amino acids 46-360; Chakerian et al. (1991) J. Biol. Chem. 
2661371 ; Alberti et al. (1993) EMBO J- 12:3227; and Lewis et al. (1996) Nature 
27V1247) illustrates this class. Furthermore, since the fusion proteins may contain more 
than one activation unit linked to the bundling domain, each of the four proteins of the 
tetramer can contain more than one activation unit (and the complex may compnse more 

than 4 activation units). 

Other illustrative tetramerization domains include those derived from residues 322- 
355 of P 53 (Wang et al. (1994) Mol. Cell. Biol. 14:5182; Clore et al. (1994) Science 
265 386) see also U.S. Pat No. 5.573.925 by Halazonetis. Other bundling domains can 
be derived from the Dimerization cefaclor of hepatocyte nuclear factor-1 (DCoH). DCoH 
associates with specific DNA binding proteins and also catalyses the dehydration of the 
biopterin cefaclor of phenylalanine hydroxylase. DCoH is a tetramer. See e.g. Endnzzi. 
J A Cronk, J.D., Wang. W.. Crabtree. G.R and Alber. T. (1995) Science 268. 556559; 
Suck and Rcner (1996) FEBS Lett 389(1 ):3-39; Standmann, Senkel and Ryffel (1998) Int J 

DevBiol 42(1):53-59 

The bundling domain may comprise a naturally-occurring peptide sequence or a 
modified or artificial peptide sequence. Sequence modifications in the bundling domain may 
be used to increase the stability of bundle formation or to help avoid unintended bundling 
with native protein molecules in the engineered cells which contain a wild-type bundling 
domain. 

For example sequence substitutions that stabilize oligomerization driven by leucine 
zippers are known (Krylov et al. (1994) cited above; OShea et al. (1992) cited above). 
To illustrate, residues 174 or 175 of human p53 may be replaced by glutamine or leucine, 
respectively. 

To illustrate sequence modifications aimed at avoiding unintended bundling with 
endogenous protein molecules, the p53 tetramerization domain may be modified to reduce 
the likelihood of bundling with endogenous P 53 proteins that have a wild-type p53 
tetramerization domain, such as wild-type P 53 or tumor-derived p53 mutants. Such 
altered p53 tetramerization domains are described in U.S. Pat. No. 5.573.925 by 
Halazonetis and are characterized by disruption of the native p53 tetramerization doma.n 
and insertion of a heterologous bundling domain in a way that preserves tetramenzation. 
Disruption of the P 53 tetramerization domain involving residues 335-348. or a subset of 
these residues, sufficiently disrupts the function of this domain so that it can no longer dnve 
5 tetramerization with wild-type p53 or tumor-derived p53 mutants. At the same time. 
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however, introduction of a heteroiogous dimerization domain reestablishes the ability to 
form tetramers. which is mediated both by the heterologous dimenzat,on doma,n and by the 
residual portion of the p53 tetramerization domain sequence. 

Other suitable bundling domains can be readily selected or des.gned by the 
5 practitioner, including semi-artificial bundling domains, such as variants of the GCN4 leuane 
5 * PP er that form tetramers (Alberti et aL (1993) ENIBO J. 12:3227-32 3B; J Harbury et a . 

(1993) Science 262:1401-1407; Krylov et al. (1994) (1994) EMBO J. 1 3:2849-2861 > The 
tetrameric variant of GCN4 leucine zipper described in Harbury et al. (1993). supra, has 
isoleucines at positions d of the coiled coil and leucines at positions a. in contrast to the 
,o original zipper which has leucines and valines, respectively. 

The choice of bundling domain can be based, at least in part, on the des.red 
conformation of the bundles. For instance, the GCN4 leucine zipper drives parallel subunrt 
assembly [Harbury et al. (1993). cited above], while the native P 53 tetramerization 
domain drives antiparallel assembly [Clore et al. (1994) cited above; Sakamoto et al. 
15 (1994) Proc. Natl. Acad. Sci. USA 91:8974-8978]. - 

in addition, a variety of techn.ques are available for identifying other naturally 
occurring bundling domains, as well as for selecting bundling domains derived from mutant 
otherwise artificial sequences. See. for example. Zeng et al. (1997) Gene 185245. 
O'Shea et al. (1 992) Cell 68:699-708; Krylov et al. [cited above]. 
20 In applications of the invention involving the genetic engineering of cells wth.n (or 

tor use within) whole animals, the use of peptide sequence derived from that spec.es js 
preferred when possible. For instance, for applications involving human gene therapy, use 
of bundling domains derived from human proteins may minimize the risk of immunogen.c 
reactions. However, in some cases the use of bundling domains of human ong.n may 
25 induce interactions between the fusion proteins and the endogenous protein from wh.ch the 
bundling domain was derived, i.e.. Iead.ng to unwanted bundling of fus.on prote.ns wrth the 
endogenous protein containing the identical bundling domain. Such interactions, in addton 
to inhibiting target gene expression, may also have other adverse effects in the cell. e.g.. 
by interfering with the function of the endogenous protein from which the bundling doma.n 

30 was derived. . 

Approaches for avoiding unwanted bundling of fusion proteins of this .nvent.cn 
with endogenous proteins include using a bundling domain which is (a) heterologous to the 
host organism, (b) expressed by the host organism but only (or P redominant, ^ n t ^'^ r 
tissues other than those which will express the fusion proteins, or (c) eng.neered through 
35 modification in peptide sequence such that it bundles preferentially with Itself rather than 
with an endogenous bundling domain. 

The first approach is illustrated by the use of a bacterial lac repressor 
tetramerization domain in human cells. 

The second approach requires the use of a bundling domain denved from a prote.n 
40 which is not expressed in the cells or tissues which are to be engineered to express the 
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fusion protein(s) ot this invention, at least not at a level which would cause undue 
interference with the bundling application or with normal cell function. Fusion protems 
containing a bundling domain derived from an endogenous protein expressed select.vely or 
IfelentLy in one tissue could b expressed in a different tissue without any adverse 
effects. For example, to regulate gene expression in human muscle, fusion prote.ns 
containing bundling domains from a protein expressed in liver, brain or some other tissue or 
tissues-but not in musde-can be expressed in musde cells without undue nsk of 
mismatched bundling. 

in the third approach, and as noted previousty. the binding spec.fic.ty of the 
bundling domain is engineered by alterations in peptide sequence to replace (in whole or 
part) bundling activity for proteins containing the wild-type bundling domain with bundl.ng 
activity for proteins containing the modified peptide sequence. 

Several examples of tissue-specific bundling domains which could be used in the 
practice of this invention include bundling domains derived from the Retinoid X receptor. 
(Kersten. S.. Reczek. P.R and N. Noy (1997) J. Biol. Chem. 272. 29759-29768); 
Dopamine D3 receptor (Nimchinsky. EA, Hot. P.R.. Janssen. W.G.M.. Morrison. J.H and 
C Schmauss (1997) J. Biol. Chem. 272. 29229-29237); Butyrylcholinesterase (Biong. 
RM Bedows.EandO.Lockridge(1997)Biochem.J.327.747-757); Tyrosme 
Hydroxylase (Goodwill. K.E.. Sabatier. C. Marks. C. Raag. R.. Rtzpatrick. Pfand^ 
Stevens (1997) Nat. Struct. Biol 7. 578-585). Bcr (McWhirter. J.R.. Galasso. D.L and J.Y. 
Wang (1993) Mol. Cell. Biol. 13. 7587-7595); and Apolipoprotein E (Westerlund, J.A and 
K.H. Weisgraber (1993) J. Biol. Chem. 268. 15745-15750). 

Transcription Activation Domains / Activation Units 

Transcription activation domains and activation units can comprise naturally- 
occurring or non-naturally-occurring peptide sequence so fong as they are capable of 
activating or potentiating transcription of a target gene construct. A variety of polypepfides 
and polypeptide sequences which can activate or potentiate transcription in eukaryotic 
cells are known and in many cases have been shown to retain their activation function 
when expressed as a component of a fusion protein. An activation unit is generally at 
least 6 amino acids, and preferably contains no more than about 300 amino add res.dues. 
more preferably less than 200. or even less than 100 residues. 

Naturally occurring activation units include portions of transcription factors, such as 
a thirty amino acid sequence trom the C-terminus of VP1 6 (amino acids 461 -490). referred 
to herein as "Vc". Other activation units are derived from naturally occurring pepfides. For 
example the replacement of one amino acid of a naturally occumng activaton un.t by 
another may further increase activation. An example of such an activation unrt ,s a 
derivative of an eight amino acid peptide of VP16. the derivative having the am.no aad 
sequence DFDLDMLG. Other activation units are -synthetic" or -artificial" in that they are 
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not derived Irom a naturaUy occurring sequence. It is known, for example, that certa-n 
random alignments of acidic amino adds are capable of activating transection 

Certain transcription factors are known to be active only m speofic ce types ..e., 
they activate transcription in a tissue specific manner. By using activate umts wh,ch 
Lotion selectively or preferentially in specific ce..s, it is possible to des,gn a transcnpfona. 
activator of the invention having a desired tissue specificity. 

One source of peptide sequence for use in a fusion protein of th,s .nvent.cn ,s the 
herpes implex virus virion protein 1 6 (referred to herein as VP1 6. the amino -d seance 
o. Ich is disclosed in Triezenberg. SJ. et al. (1988) Genes Dev. 2:718-729). For 
example, an activation unit corresponding to about 127 of the C-term-na. ammo acds of 
VP1 6 can be used. Alternatively, at least one copy of about 1 1 amino acds from the C - 
terminal region of VP16 which retains transcription activation ab...ty ,s used as an acfcvat.on 
unit. Preferably, an oligomer comprising two or more copies of this .sequence • used. 
Suitable C-terminal peptide portions of VP16 include those descnbed .n Se,pel. K. et al. 

(EMBO J. (1992) 13:4961-4968). 

Another example of an acidic activation unit is provided ,n rescues 753-881 of 

GAU ' One particularly important source of transcription activation units is the (human) NF- 
kB subunit p65. The activation domain may contain one or more copies of a pepude 
sequence comprising al. or part of the P 65 sequence spann.ng res.dues 450-550 or a 
peptide sequence denved therefrom. In certain embodiments, it has beer 
extending the P 65 peptide sequence to include sequence spann.ng P 65 res,due 361 -450. 
e.g.. including the "AP activation unif. .eads to an unexpected increase «J™™* °" 
activation. Moreover, a peptide sequence comprising ail or a portion of P<^1-550) or 
peptide sequence derived therefrom, in combination with heterologous act^at-on unrts. can 
y.eld surprising additional increases in the level of transcription action. P*^»* 
actuation domains .unction across a broad range of promoters and .n a number of bundhng 
experiments have yie.ded increases in transcription levels of chro ™ S ^^^ed 
target genes six-fold, eight-fold and even 14-15-fo.d higher than obta,ned wth unbundled 
tandem copies of VP1 6 which itself is widely recognized as a very potent act,vat,on 

domain . - . 

It is expected that recombinant DNA molecules encoding fusion proteins wh.ch 
contain a P 65 activation unit, or peptide sequence derived therefrom, will provide significant 
advantages for heterologous gene expression in its various contexts, .nclud.ng d.menza.on 
based regulated systems such as described in Internationa, patent appl.catons 
PCTAJS94/01617. PCT/US95/10591. PCT/US96/09948 and the l.ke. as we., as ,n other 
heterologous transcription systems induding a.lostery-based regutation such as those 
involving tetracydine based regulation reported by Bujard et al. and those .nvolvng 
steroid or other hormone-based regulation. 
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One class of p65-based transcription factors contain more than one copy of a p65- 
derived domain. Such proteins will typically contain two or more, gen rally up to about s.x. 
copies of a peptide sequence comprising all or a portion of p65(361-550). or pept.de 
sequence derived therefrom. Such iterated p65-based transcription activation domains are 
useful both in bundled and non-bundled approaches. 

Other polypeptides with transcription activation activity in eukaryotic cells can be 
used to provide activation units for the fusion proteins of this invention. Transcription 
activation domains found within various proteins have been grouped into categones based 
upon shared structural features. Types of transcription activation domains include aad.c 
transcription activation domains (noted previously), proline-rich transcription act.vat.on 
domains serine/threonine-rich transcription activation domains and glutam.ne-r.ch 
transcription activation domains. Examples of proline-rich activation domains include am.no 
acid residues 39JM99 of CTF/NF1 and amino acid residues 31-76 of AP2. Examples of 
serine/threonine-rich transcription activation domains include amino acid residues 1 -427 of 
ITF1 and amino acid residues 2-451 of ITF2. Examples of glutamine-rich act.vat.on 
domains include amino acid residues 175-269 of Oct1 and amino acid residues 132-243 of 
Sp1 The amino acid sequences of each of the above described regions, and of other 
useful transcription activation domains, are disclosed in Seipel, K. et al. (EMBO J. (1992) 
13:4961-4968). 

Still other illustrative activation domains and motifs of human ongin include tne 
activation domain of human CTF. the 1 8 amino acid (NFLQLPQQTQGALLTSQP) 
glutamine rich region of Oct-2. the N-terminal 72 amino acids of P 53. the SYGQQS repeat 
in Ewing sarcoma gene and an 1 1 amino acid (535-545) acidic rich region of Rel A protem. 

In addition to previously described transcription activation domains, novel 
transcription activation units, which can be identified by standard techniques, are within the 
scope of the invention. The transcription activation ability of a polypept.de can be 
assayed by linking the polypeptide to a DNA binding domainand determining the amount 
of transcription of a target sequence that is stimulated by the fusion protein. For example, 
a standard assay used in the art utilizes a fusion protein of a putative activation unit and a 
GAL4 DNA binding domain (e.g.. amino acid residues 1-93). This fusion prote.n .s then 
used to stimulate expression of a reporter gene linked to GAL4 binding sites (see e.g.. 
Seipel. K. et al. (1992) EMBO J. 1 1 :4961 -4968 and references cited therein). 

' The activation domains of the invention can be from any eukaryotic species 
(including but not limited to various yeast species and various vertebrate spec.es. 
including the mammals), and it is not necessary that every activation unit or domain be from 
the same species. In applications of this invention to whole organisms, it is often 
preferable to use activation units and activation domains from the same species as the 
recipient to avoid immune reactions against the fusion proteins. 
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Mutations in the Activation Domain 

One way to increase the potency of an activation domain is to increase rts acidic or 
hydrophobic content through modifications in peptide sequence. Acidic amino acids which 
can increase potency of activation domains include aspartic acid and glutamic aad. In some 
cases one may want to riPcrease (usually only modestly) the potency of the activation 
domain in order to obtain a less steep activation curve, especially if a greater number of 
individually weaker activation domains will be deployed together, e.g.. by bundling. 

Thus in one embodiment of this invention, mutations are introduced into the 
activation domain by standard techniques known in the art, such as site^irected PCR 
based mutagenesis. In this embodiment, one to five, in some cases one to three, 
alterations in peptide sequence can be introduced into the DMA coding for the action 
domain Each of these mutations either alone or in combination with one or more other 
mutations may be readily assayed for its ability to induce the transcription of either 
transiently transfected or stably integrated target reporter gene constructs. For instance, a 
construct encoding a fusion protein containing multiple copies of the modified sequence and 
a DNA binding domain can be introduced into cells and the activity of the encoded fusion 
protein measured in transcription assays (with a responsive reporter gene construct) and 
compared to analogous fusion proteins containing wild-type activation domain sequence or 

a different mutation of interest. 

The foregoing is illustrated in the case of the P 65 transcription activation doma.ns. 
Constructs are prepared encoding fusion proteins containing one or more p65 transcnption 
activation domains and a DNA binding domain. The p65 domains may be wild-type (as a 
control) or may contain any of a variety of alterations in peptide sequence. These 
mutations can generally be introduced into a variety of p65Kferived transcription actrvation 
domains. For example. M1 mutations can be introduced into plasrnids carrying p65 
activation domain coding regions between amino acids 533 and 550. or 361 and 550. or 

280 and 550. . , 

Exemplary mutations for p65 transcription activation domains include those 

intended to increase the potency of the p65 activation domain (including the M1 . M2 

M6 M7 and M8 mutations) and those intended to decrease the potency (generally slightly) 

of the activation domain. The p55 activation domain contains four phenylalanine residues 

and mutations that convert these residues to alanine has been shown to significantly 

reduce the potency of the p65 activation domain in yeast and in vitro experiments. Our 

experiments show that changing F 533 and F 541 to alanine residues reduced the potency 

of P 65 activation domain to half of wild type level. Mutations of the M4 and M5 class 

change the conserved serine and proline residues between amino-acids 361 and 450. Our 

data show that M4 and M5 mutant sequences can induce the expression of target genes 

synergistically when fused to other acidic type activation domains. In GST pull down 

assays the region of the M4 and M5 mutations interacts with TFIIA. Although M4 and M5 

mutations individually have a very small effect on the ability of P 65 activation domain to 
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induce the reporter gene, combined together, they significantly reduce its potency. Thus, 
mutations for the practitioner to bear in mind include, but are not limited to, the following: 

WT: 532-DFSStADMDFSALLSQIS 
5 M1 : 532-DFSDFADMDFDADLSQIS 

WT: 439-ALLQLQFDDED 
M2: 439-ALLDLDFDDED 
WT: 529-QDEDFSSIADMDFSALLSQI 
M3: 529-GDEDASSIADMDASALLSQI 
10 WT: 377-SALALPAPPQVL 

M4: 377-GALALGAGGQVL 
WT: 401-SALAQAPAPVP 
M5: 401 -GALAQAGAGVG 
WT: 434-GTLSEALLQLQFD 
15 M6: 434-GDFS-ALLQLQFD 

WT: 472-SEFQQLLNQ 
M7: 472-SEFSALLNQ 
WT: 472-SEFQQLLNQ 
M8: 472-SDFQQLLNQ 
20 WT: 530-DEDFSSIADMDFS 

M9: 530-DEDFSSLLDMDFS 

Synergizing Domains 

A synergizing domain is any domain which observably increases the potency of 
25 transcription activation when recruited to the promoter aiong with the transcription act.vat.on 
domain. A synergizing domain can be an independent transcription activation doma.n or an 
activation unit which on its own does not induce transcription bat is able to P*""™. *» 
activity of a transcription activation domain with which it is linked covalently (i.e.. wrth.n the 
same fusion protein) or with which K is associated non^ovalently (e.g.. through bundbng or 
30 ligand-mediated clustering). 

One example of a synergizing domain is the so-called "alanine/prol.ne nch or AP" 
activation motif of P 65. which extends from about amino acids 361 to about amino ac.d 450 
of that protein. Similar AP activation motifs are also present in, e.g.. the p53 and CTF 
proteins. The presence of one or several copies of an AP domain alone in a prote.n does 
35 not itself provide the ability to induce activator-dependent transcription act.vat.on. 

However, when linked to activation units which are themselves capable of inducng some 
level of activator-dependent transcription, e.g., another portion of P 65 or VP16. the AP 
activation unit synergizes with the second activation domain to induce an increase .n the 
level of transcription. 
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Accordingly, the invention provides an AP activation unit, functional derivatrve 
thereof or other synergizing domain which on its own is incapable of activating 
transcription. Functional alternative sequences for use as synergizing domains, .nclud.ng 
among others derivatives of an AP activation unit, can be obtained, for instance, by 
5 screening candidate sequences for binding to TFIIA and measuring transcriptional actrnty m 
a co-transfection assay. Such equivalents are expected to include forms of the activation 
unit which are truncated at either the N-terminus or C-terminus or both. e.g.. fragments of 
P 65 (or homologous sequences thereto) which are about 75, 60. 50. 30 or even 20 am.no 
acid residues in length (e.g., ranging in length from 20-89 amino acids). Ukew.se, rt .s 
,o expected that the AP activation unit sequence from P 65 can tolerate ammo aad 

substitutions, e.g.. to produce AP motifs of at least 95%. 90%. 80% and even 70% 
identity with the AP activation unit sequence of SEQ ID No. 2 of USSN 08/918.401. 
These and other AP derivatives include, for example. AP domains based on naturally- 
occurring sequence but modified by the replacement, insertion or deletion of 1 , 2, 3. 4 or 5 

15 amino acid residues. 

Other synergizing domains are independent activation doma.ns, e.g. VP1 6. While 
VP16 can activate transcription on its own, it can synergize with p65 to produce levels of 
transcription that are greater than the sum of the transcription levels effected by each 
activation domain alone. As shown in the examples, fusion of VP1 6 to a nucleic add 

2 o containing an FRB domain, a lac repressor tetramerization domam and p65 greatly 

increases the level of expression of a target gene as compared to the same construct .n the 
absence of VP16. 

Synergizing domains may also be fused to an unbundled or bundled DNA b.nd.ng 
domain To avoid the activation of transcription in a constitutive manner with constructs 
25 such as these, it is preferable that the synergizing domain itself be incapable of activating 
transcription. 
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Ugand binding domains 

Fusion proteins containing a ligand binding domain for use in practicing this 
invention can function through one of a variety of molecular mechanisms. 

In certain embodiments, the ligand binding domain permits ligand-mediated cross- 
linking of the fusion protein molecules bearing appropriate ligand binding doma.ns. In these 
cases the ligand is at least divalent and functions as a dimerizing agent by b.nd.ng to the 
two fusion proteins and forming a cross-linked heterodimeric complex which actuates target 
gene expression. See e.g. WO 94/18317. WO 96/20951. WO 96/06097. WO 97/31898 
and WO 96/41865. 

In other embodiments, the binding of ligand to fusion protein is thought to result .n 
an allosteric change in the protein leading to the binding of the fusion protem to a target 
DNA sequence [see e.g. US 5.654.168 and 5.650.298 (tet systems), and WO 93/23431 
and WO 98/18925 (RU486-based systems)! or to another protein which b.nds to the 
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target DNA sequence [see e.g. WO 96/37609 and WO 97/38117 (ecdysone/RXR-based 
systems)], in either case, modulating target gen expression. 

Dimerization-based systems 

In the cross-linking-based dimerization systems the fusion proteins can contain one 
or more ligand binding domains (in some cases containing two. three or four such domains) 
and can further contain one or more additional domains, heterologous with respect to the 
ligand binding domain, including e.g. a DNA bintfmg domain, transcription activate daman. 

In general, any ligand/ligand binding domain pair may be used in such systems. For 
example, ligand binding domains may be derived from an immunophilin such as an FKBP. 
cyclophilin, FRB domain, hormone receptor protein, antibody, etc., so long as a l.gand is 
known or can be identified for the ligand binding domain. 

For the most part, the receptor domains will be at least about 50 amino adds, and 
fewer than about 350 amino acids, usually fewer than 200 amino acids, either as the 
natural domain or truncated active portion thereof. Preferably the binding domain will be 
small «25 kDa to allow efficient transection in viral vectors), monomeric. nonimmunogemc. 
and should have synthetically accessible, cell permeant, nontoxic ligands as described 
above 

'preferably the ligand binding domain is for (i.e.. binds to) a ligand which is not itself 
a gene product (i.e.. is not a protein), has a molecular weight of less than about 5 kD and 
preferably less than about 2.5 kD. and is cell permeant. In many cases it will be preferred 
that the ligand does not have an intrinsic pharmacologic activity or toxicity which interferes 
with its use as a transcription regulator. 

The DNA sequence encoding the ligand binding domain can be subjected to 
mutagenesis for a variety of reasons. The mutagenized ligand binding domain can prov.de 
for higher binding affinity, allow for discrimination by a ligand between the mutant and 
naturally occurring forms of the ligand binding domain, provide opportunities to design 
ligand-tigand binding domain pairs, or the like. The change in the ligand binding domain can 
involve directed changes in amino acids known to be involved in ligand binding or with 
ligand-dependent conformational changes. Alternatively, one may employ random 
mutagenesis using combinatorial techniques. In either event, the mutant ligand binding 
domain can be expressed in an appropriate prokaryotic or eukarotic host and then 
screened for desired ligand binding or conformational properties. Examples involving FKBP. 
cyclophilin and FRB domains are disclosed in detail in WO 94/18317. WO 96/06097 WO 
97/31898 and WO 96/41865. For instance, one can change Phe36 to Ala and/or Asp37 to 
Gly or Ala in FKBP12 to accommodate a substituent at positions 9 or 10 of the ligand 
FK506 or FK520 or analogs, mimics, dimers or other derivatives thereof. In particular, mutant 
FKBP12 domains which contain Val. Ala, Gly, Met or other small amino ackls in place of one 
or more of Tyr26. Phe36. Asp37. Tyr82 and Phe99 are of particular interest as receptor 
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domains for FK506-type and FK-520-type ligands containing modifications at C9 and/or 
C1 0 and their synthetic counterparts (see e.g., WO 97/31 898). Illustrative mutations of 
current interest in FKBP domains also include the following: 



F35A 


Y26V _J 


F46A 


W59A 


F36V 


Y26S 


F48H 


H87W 


F36M 


D37A 


F48L 


H87R 


F36S 


I90A 


F48A 


F36V/F99A 


F99A 


I91A 


E54A/F36V/F99G 


F99G 


F46H 


I E54K/F36M/F99A 


Y26A 


F46L 


V55A | F36M/F99G 







Table i: Entries identify the native amino acid by singleletter age g^SSXSSS^ 
followed by the replacement amino acid in the mutant. Thus. P36V des '9nates a numan 
PKRP12 seauence in which phenylalanine at position 36 is replaced by vahne. 
^SSftTSSSte! ^a double mutation in which phenylalanine at posrt.ons 36 and 99 are 
replacedby valine and alanine, respectively. 



Illustrative examples of domains which bind to the FKBP:rapamycin complex 
("FRBs") are those which include an approximately 89-amino acid sequence containing 
residues 2025-21 1 3 of human FRAP. Another FRAP-derived sequence of interest 
comprises a 93 amino acid sequence consisting of amino acids 2024-21 13. Similar 
considerations apply to the generation of mutant FRAP-derived domains which bind 
preferentially to FKBP complexes with rapamycin analogs (rapalogs) containing 
modifications (i.e.. are 'bumped') relative to rapamycin in the FRAP-binding portion of the 
drug. For example, one may obtain preferential binding using rapalogs bearing 
substituents other than -OMe at the C7 position with FRBs based on the human FRAP 
FRB peptide sequence but bearing amino acid substitutions for one of more of the residues 
Tyr2038 Phe2039. Thr2098, Gln2099. Trp2101 and Asp21 02. Exemplary mutations 
include Y2038H, Y2038L. Y2038V. Y2038A. F2039H. F2039L. F2039A. F2039V. D2102A, 
T2098A T2098N. T2098L. and T2098S. Rapalogs bearing substituents other than -OH at 
C28 and/or substituents other than =0 at C30 may be used to obtain preferential binding 
to FRAP proteins bearing an amino acid substitution for Glu2032. Exemplary mutations 
include E2032A and E2032S. Proteins comprising an FRB containing one or more amino 
add replacements at the foregoing positions, libraries of proteins or peptides randomized at 
those positions (i.e., containing various substituted amino acids at those residues), 
libraries randomizing the entire protein domain, or combinations of these sets of mutants are 
made using the procedures described above to identify mutant FRAPs that bind 
preferentially to bumped rapalogs. 
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Other macrolide binding domains useful in the present invention, including mutants 
thereof are described in the art. See. for example. W096/41865. W096/13613. 
WO96/06111. WO96/06110. WO96/06097. W096/12796. WO95/05389. WO95/02684. 
W094/18317. 

The ability to employ in vitro mutagenesis or combinatorial modifications of 
sequences encoding proteins allows for the production of libraries of proteins which can be 
screened for binding affinity for different ligands. For example, one can randomize a 
sequence of 1 to 5. 5 to 10. or 10 or more codons. at one or more sites in a DNA sequence 
encodmg a binding protein, make an expression construct and introduce the expression 
construct into a unicellular microorganism, and develop a library of modified sequences. 
One can then screen the library for binding affinity of the encoded polypeptides to one or 
more ligands. The best affinity sequences which are compatible with the cells into which 
they would be introduced can then be used as the ligand binding domain for a given ligand. 
The ligand may be evaluated with the desired host cells to determine the level of binding of 
the ligand to endogenous proteins. A binding profile may be detemined tor each such 
ligand which compares ligand binding affinity for the modified ligand binding domain to the 
affinity for endogenous proteins. Those ligands which have the best binding profile could 
then be used as the ligand. Phage display techniques, as a non-limiting example, can be 
used in carrying out the foregoing. 

In other embodiments, antibody subunits, e.g. heavy or light chain, particularly 
fragments, more particularly all or part of the variable region, or single chain antibodies, can 
be used as the ligand binding domain. Antibodies can be prepared against haptens wh«ch 
are pharmaceutical^ acceptable and the individual antibody subunits screened for binding 
affinity. cDNA encoding the antibody subunits can be isolated and modified by deletion of 
the constant region, portions of the variable region, mutagenesis of the variable region, or 
the like, to obtain a binding protein domain that has the appropriate affinity for the ligand. 
In this way. almost any physiologically acceptable hapten can be employed as the ligand. 
Instead of antibody units, natural receptors can be employed, especially where the 
binding domain is known. In some embodiments of the invention, a fusion protein 
comprises more than one ligand binding domain. For example, a DNA binding domain can 
be linked to 2. 3 or 4 or more ligand binding domains. The presence of multiple ligand 
binding domains means that ligand-mediated cross-linking can recruit multiple fusion 
proteins containing transcription activation domains to the DNA binding domain-containing 
fusion protein. 

Allostery-based systems 

As mentioned previously, systems for transcription regulation based on ligand- 
dependent allosteric changes in a chimeric transcription factor are also useful in practicing 
the subject invention. One such system employs a deletion mutant of the human 
progesterone receptor which no longer binds progesterone or other endogenous steroids 
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but can be activated by the orally active progesterone antagonist RU486, described, e.g. 
in Wang et al. (1994) Proc. Natl. Acad. Sci. USA. 91 :8180. Activation was demonstrated 
in cells transplanted into mice using doses of RU486 (5-50 g/kg) considerably below the 
usual dose for inducing abortion in humans (10 mg/kg). However, the reported induction 
ratio in culture and in animals was rather low. 

Another such system is the ecdysone inducible system. Early work demonstrated 
that fusing the Drosophila steroid ecdysone (Ec) receptor (EcR) Ec- binding domain to 
heterologous DNA binding and activation domains, such as E. coli lexA and herpesvirus 
VP16 permits ecdysone-dependent activation of target genes downstream of appropnate 
binding sites (Christopherson et al. (1992) Proc. Natl. Acad. Sci. USA. 89:6314). An 
improved ecdysone regulation system has been developed, using the DNA b.nd.ng 
domain of the EcR itself. In this system, the regulating transcription factor is provided as 
two proteins: (1) a truncated, mutant EcR fused to herpes VP16 and (2) the mammalian 
homolog (RXR) of Ultraspiracle protein (USP). which heterodimerizes with the EcR (No et 
al (1996) Proc. Natl. Acad. Sci. USA 93:3346). In this system, because the DNA 
binding domain was also recognized by a human receptor (the human famesoid X 
receptor), it was altered to a site recognized only by the mutant EcR. Thus, the invention 
provides an ecdysone inducible system, in which a truncated mutant EcR is fused to at 
least one subunit of a transcription activator of the invention. The transcription factor further 
comprises USP. thereby providing high level induction of transcription of a target 
genehaving the EcR target sequence, dependent on the presence of ecdysone. 

In another approach, the inducible system comprises or is derived from the 
E coli tet repressor (TetR), which binds to tet operator (tetO) sequences upstream of 
target genes. In the presence of tetracycline, or a tetracycline analog which bind to tetR. 
DNA binding is abolished and thus transaction is abolished. This system, in wh.ch the 
TetR had previously been linked to transcription activation domains, e.g. from VP16, is 
generally referred to as an allosteric 'off-switch" described by Gossen and Bujard (Proc. 
Natl Acad. Sci. U.S.A. (1992) 89:5547) and in U.S. Patents 5.464.758; 5.650598: and 
5.589.362 by Bujard et al. Target gene expression is reportedly regulatable over several 
orders of magnitude in a reversible manner. This system is said to provide low 
background and relatively high target gene expression in the absence of tetracycline or an 
analog The invention described herein provides a method for obtaining even stronger 
transcription induction of a target gene, which is regulatable by the tetracycline system or 
other inducible DNA binding domain. 

In some embodiments, a "reverse" Tet system is used, again based on a DNA 
binding domain that is a mutant of the E. coli TetR. but which binds to TetO in the 
presence of Tet Additional information on mutated tetR-based systems is prov.ded above 
and in patent documents cited previously. The use of bundling as descnbed herein 
provides a method for obtaining even stronger transcription induction of a target gene in the 
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presence of tetracycline or an analog thereof from a very low background in the absence of 
tetracycline. 

A tetR domain useful in the practice of this invention may compnse a naturally 
occurring peptide sequence of a tetR of any of the various classes (e.g. class A, B. C. D 
or E) (in which case the absence of the ligand stimulates target gene transcnption). or more 
preferably, comprises a mutated tetR which is derived from a naturally occurring sequence 
from which it differs by at least one amino acid substitution, addition or deletion. Of 
particular interest are those mutated tetR domains in which the presence of the hgand 
stimulates binding to the TetO sequence, usually to induce target gene transcnpt.on ,n a 
cell engineered in accordance with this invention. For example, mutated tetR domains 
include mutated Tn10-derived tetR domains having an amino acid substitution at one or 
more of amino acid positions 71 . 95. 101 and 102. By way of further illustration, one 
mutated tetR comprises amino acids 1 - 207 of the Tn10 tetR in which glutam.c ac,d 71 it 
changed to lysine, aspartic acid 95 is changed to asparagine. leucine 101 is changed to 
serine and glycine 102 is changed to aspartic acid. Ligands include tetracycline and a w.de 
variety of analogs and mimics of tetracycline, including for example, anhydrotetracycline 
and doxycycline. Target gene constructs in these embodiments contain a target 9*™* 
operably liriked to an expression control sequence including one or more copies of a DNA 
sequence recognized by the tetR of interest, including for example, an upstream activator 
sequence for the appropriate tet operator. See e.g. US Patent No. 5.654.168. 

Ligands of the invention 

In various embodiments where a ligand binding domain for the ligand is 
endogenous to the cells to be engineered, it is often desirable to alter the peptide 
sequence of the ligand binding domain and to use a ligand which discriminates between the 
endogenous and engineered ligand binding domains. Such a ligand should b.nd 
preferentially to the engineered ligand binding domain relative to a naturally occumng 
peptide sequence, e.g., from which the modified domain was derived. Th.s approach can 
avoid untoward intrinsic activities of the ligand. Significant guidance and illustrative 
examples toward that end are provided in the various references cited herein. 

Cross-linking/dimerization systems 

Any ligand for which a binding protein or ligand binding domain is known or can be 
identified may be used in combination with such a ligand binding domain in carrying out this 

invention. . ,. , 

Extensive guidance and examples are provided in WO 94/18317 for l.gands and 
other components useful for cross-linked oligomerization-based systems. Systems based 
on ligands for an immunophilin such as FKBP. a cyclophilin. and/or FRB doma.n are of 
special interesL Illustrative examples of ligand binding domain/ligand pairs that may be 
used for cross-linking indude, but are not limited to: FKBP/FK1012 . FKBP/synthetic 
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divalent FKBP ligands (see WO 96/06097 and WO 97/31 898). FRB/rapamycin or analogs 
thereof FKBP (see e.g., WO 93/33052. WO 96/41865 and Rivera et at. "A humanized 
system for pharmacologic control of gene expression". Nature Medicine 2(9 ^-lOSS 
(1997)) cydophi.in/cyclosporin (see e.g. WO 94/18317). FKBP/FKCsA/cycloph.hn (see 
e g Belshaw et al. 1996. PNAS 93:4604-4607). DHFR/methotrexate (see e.g. Lictra et al. 
1996 Proc Natl. Acad. Sci. USA 93:12817-12821), and DNA gyrase/coumermycin (see 
e g Farrar et al, 1996. Nature 383:178-181). Numerous variations and modrfications to 
ligands and ligand binding domains, as well as methodologies for designing, selecting 
and/or characterizing them, which may be adapted to the present invention are disclosed .n 
the cited references. 

Allostery-based systems ^,„ tf , ic 
For additional guidance on ligands for other systems which may be adapted to th.s 
invention, see e.g. (Gossen and Bujard Proc. Natl. Acad. Sci. U.SA 1992 89:5547. and 
US Patent Nos. 5654168, 5650298, 5589362 and 5464758 (TetRrtetracycline). Wang et al. 
1994 Proc. Natl. Acad. Sci. USA 91:8180-8184 (progesterone receptor/RU486). and No et 
al 1996 Proc. Natl. Acad. Sci. USA 93:3346-3351 (ecdysone receptor/ecdysone). 
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DNA-binding domains 

Regulated expression systems relevant to this invention involve the use of a 
protein containing a DNA binding domain to selectively target a desired gene for 
expression (or repression). Systems based on ligand-mediated cross-linking generally rely 
upon a fusion protein containing the DNA binding domain together with one or more l.gand 
binding domains. One genera, advantage of such systems is that they are particularly 
modular in nature and lend themselves to a wide variety of design choices. These systems 
permit wide latitude-in the choice of DNA binding domains. Many allostery-based 
systems, like the TetR- and P rogesterone-R-based systems, use a fusion prote.n 
containing a DNA binding domain together with a transcription regulatory domam (e^g. a 
transcription activation or repression domain). Some allostery-based systems such as the 
ecdysone-regulated system, use a protein like RXR which contains a DNA binding domam 
together with a binding site for another protein (such as the ecdysone receptor)..Of the 
allostery-based systems, the progesterone receptor-based system and like systems 
permit relatively greater latitude in the choice of DNA binding domain. While allostery-based 
systems like the TetR- and ecdysone receptor type may be engineered at the DN A 
binding domain, they are somewhat less amenable to ready replacement of the DNA 

binding domain. 

Various DNA binding domains may be incorporated into the des.gn of fusion 
proteins of this invention, especially those of the ligand-mediated cross-linking type and 
the progesterone-R-based type, so long as a corresponding DNA "recognition sequence 
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is known or can be identified, to which the domain is capable of binding. One or more 
copies of the recognition sequence are incorporated into, or present within, the express.cn 
control sequence of the target gene construct. Peptide sequence of human origin is often 
preferred, where available, for uses in human gene therapy. Composrte DNA ^9 
domains provide one means for achieving novel sequence specificity for the prot in-DNA 
binding interaction. An illustrative composite DNA binding domain containing component 
peptide sequences of human origin is ZFHD-1 which is described in detail below, 
.ndividual DNA-binding domains may be further modified by mutagenes IS to decrease, 
increase, or change the recognition specificity of DNA binding. These modifications can be 
achieved by rational design of substitutions in positions known to contnbute to DNA 
recognition (often based on homology to related proteins for which explicit structural data 

are available). . 

For example, in the case of a homeodomain. substitutions can be made in amino 
acids in the N-terminal arm. first loop, second helix, and third helix known to contact DNA. In 
zinc fingers, substitutions can be made at selected positions in the DNA recognition hel.x. 
Alternatively, random methods, such as selection from a phage display library can be used 
to identify altered domains with increased affinity or altered specificity. 

For additional examples, information and guidance on designing, mutating, selecting, 
combining and characterizing DNA binding domains, see. e.g.. Pomerantz Jl_ Wotfe SA, 
Pabo CO Structure-based design of a dimeric zinc finger protein Biochem.stry 1998 Jan 
27-37(4V965-970; Kim J-S and Pabo CO. Getting a Handhold on DNA: Design of Poly- 
Zinc Finger Proteins with Femtomolar Dissociation Constants. PNAS USA, 1998 Mar 
17-95(6V2812-2817; Kim JS, Pabo CO. Transcriptional repression by zinc finger 
peptides. Exploring the potential tor applications in gene therapy. . J Bio. Chem 1997 Nov 
2V272(47)-29795-29800; Greisman HA. Pabo CO . A general strategy tor selecting high- 
affinity zinc finger proteins for diverse DNA target sites. Science 1 997 Jan 
31 -275(5300):657-661 ; Rebar EJ. Greisman HA. Pabo CO. Phage display methods for 
selecting zinc finger proteins with novel DNA-binding specificities. Methods Enzvmol 
1996;267:129-149; Pomerantz Jl_ Pabo CO. Sharp PA . Analysis of homeodomain 
function by structure-based design of a transcription factor. Proc Natl Acad Sc. U S A 1995 
Oct 10:92(21 ):9752-9756; Rebar EJ. Pabo CO. Zinc finger phage: affinity selection of 
fingers with new DNA-binding specificities. Science 1994. Feb 4;263:671 -673; Choo Y. 
Sanches-Garcia I. Klug A. In vivo repression by a site-specific DNA-bind.ng protein 
designed against an oncogenic sequence. Nature 1994. Dec 15:372:642-645; Choo Y. 
Klug A. Toward a code for the interaction of zinc fingers with DNA: Selection of randomized 
fingers displayed on phage. PNAS USA. Nov 1994; 91:1 1 163-1 1 167; Wu H. Yang W-P. 
Barbas CF III. Building zinc fingers by selection: toward a therapeutic application PNAS 
USA January 1995; 92:344-348; Jamieson AC. Kim S-H. Wells JA. In Vitro selection of 
zinc fingers with altered DNA-binrf.ng specificity, Biochemistry 1994. 33:5689-5695. 
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international patent applications WO 96/20951 , WO 94/18317. WO 96/06166 and WO 
95/19431 ; and USSN 60/084819. 

Additional domains and linkers 

Additional domains may be included in the fusion proteins of this invention. 
For example, the fusion proteins may contain a nuclear localization sequence (NLS) 
which provides for the protein to be translocated to the nucleus. A NLS can be located at 
the N-terminus or the C-terminus of a fusion protein, or can be located between component 
portions of the fusion protein, so long as the function of fusion protein and its components 
is disrupted by presence of the NLS. Typically a nuclear localization sequence has a 
plurality of basic amino acids, referred to as a bipartite basic repeat (reviewed in Garcia- 
Bustos et at. (1991) Biochimica et Biophysics Acta 1071:83-101). One illustrative NLS is 
derived from the NLS of the SV40 large T antigen which is comprised of amino adds 
proline-iysine-lysine-lysine-arginine-lysine-valine (Kalderon et al. (1984) Cell 39:499-509). 
Another illustrative NLS is derived from a p53 protein. Wild-type p53 contains three C- 
terminal nuclear localization signals, comprising residues 316-325. 369-375 and 379-384 of 
P 53 (Shaulsky et al. (1990) Mol. Cell. Biol. 10:6565-6577). Other NLSs are descnbed by 
Shaulsky et al (1990) supra and Shaulsky et al. (1991) Oncogene 62056. 

To facilitate their detection and/or purification, the fusion proteins may contain 
peptide portions such as "histidine tags", a glutathicne-S-transferase doma.n or an 
"epitope tag" which can be recognized by an antibody. 

The intervening distance and relative orientation of the various component domains 
of the fusion proteins can be varied to optimize their production or performance. The design 
of the fusion proteins may include one or more/linkers", comprising pept.de sequence 
(which may be naturally^ccurring or not) separating individual component polypeptide 
sequences. Many examples of linker sequences, their occurrence in nature, their design 
and their use in fusion proteins are known. See e.g. Huston et al. (1988) PNAS 85:4879; 
U S. Patent No. 5,091.513; and Richardson et al. (1988) Science 240:1648-1652. 



30 Target gene constructs 

A target gene construct comprises a gene of interest operably linked to an 
expression control sequence which permits ligand-regulated expression of the gene. More 
specifically, such a construct typically comprises: (1 ) one or more copies of a DNA 
sequence recognized by a DNA binding domain of a fusion protein of the invent.on (or by 

35 a DNA binding protein like RXR which binds to a fusion protein of the mvention); (2) a 
promoter sequence consisting minimally of a TATA box and initiator sequence but 
optionally including other transcription factor binding sites; (3) sequence encoding the 
desired product, including sequences that promote the initiation and teimination of 
translation, if appropriate; (4) an optional sequence consisting of a splice donor splice 

40 acceptor, and intervening intron DNA; and (5) a sequence directing cleavage and 
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polyadenyfation of the resulting RNA transcript. Typically the construct contains a copy of 
the target gene to be expressed, operably linked to an expression control sequence 
comprising a minimal promoter and one or more copies of a DMA recognition sequence 
responsive to the transcription factor. 

(a) Target genes 

A wide variety of genes can be employed as the target gene, including genes that 
encode a therapeutic protein, antisense sequence or ribozyme of interest, or any other 
protein which is of therapeutic or scientific interest. The target gene (and there may be 
multiple target genes) can encode a gene product which provides a desired phenotype. It 
can encode a membrane^bound or membrane-spanning protein, a secreted protein, or a 
cytoplasmic protein. The proteins which are expressed, singly or in comb.nat.on. can 
involve homing, cytotoxicity, proliferation, differentiation, immune response, inflammatory 
response, clotting, thrombolysis, hormonal regulation, angiogenesis. etc. The polypept.de 
encoded by the target gene may be of naturally occurring or non-naturally occurrring 
peptide sequence. 

Various secreted products include hormones, such as insulin, human growth 
hormone glucagon, pituitary releasing factor, ACTH. melanotropin, relaxin. leptin.efc; 
growth factors, such as EGF. IGF-1 . TGF-alpha. -beta. PDGF. G-CSF. M-CSF. GM-CSF. 
FGF erythropoietin, thrombopoietin. megakaryocyte growth factors, nerve growth factors. 
etc ■ proteins which stimulate or inhibit angiogenesis such as angiostatin, endostatin and 
VEGF and variants thereof; interieukins. such as IL-1 to -15; TNF-alpha and -beta; and 
enzymes and other factors, such as tissue plasminogen activator, members of the 
complement cascade, perforins, superoxide dismutase; coagulation-related factors such as 
antthrombin-HI. Factor V. Factor VII. Factor Vlllc. vWF. Factor IX. alpha-anti-tryps.n. 
protein C. and protein S; endorphins, dynorphin. bone morphogenetic protein. CFTR. etc. 

The gene can encode a naturally-occurring surface membrane protein or a protein 
made so by introduction of an appropriate signal peptide and transmembrane sequence. 
Various such proteins include homing receptors, e.g. L-selectin (Mel-14). hematopoietic cell 
markers, e.g. CD3. CD4. CDS, B cell receptor. TCR subunrts alpha, beta, gamma or delta, 
CD10 CD19, CD28. CD33. CD38. CD41 . etc, receptors, such as the interieukin 
receptors IL-2R, IL-4R, etc.; receptors for other ligands including the various hormones, 
growth factors, etc.; receptor antagonists for such receptors and soluble forms of such 
receptors; channel proteins, for influx or efflux of ions. e.g. H+, Ca+2, k+. Na+. CI-. etc.. 
and the like; CFTR. tyrosine activation motif, zap-70. etc. 

Proteins may be modified for transport to a vesicle for exocytosis. By adding the 
sequence from a protein which is directed to vesicles, where the sequence is modified 
proximal to one or the other terminus, or situated in an analogous position to the protein 
source, the modified protein will be directed to the Golgi apparatus for packaging in a 
vesicle. This process in conjunction with the presence of the chimeric prote.ns for 
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exocytosis allows tor rapid transfer of the proteins to the extracellular medium and a 
relatively high localized concentration. 

The target gene product can be an intracellular protein such as a protein .nvolved m 
a metabolic pathway, or a regulatory protein, steroid receptor, transcription factor, ere 

By way of further illustration, in T-cells. one may wish to introduce genes encoding 
one or both chains of a T-cell receptor. For B-ce.ls. one could provide the heavy and ..ght 
chains for an immunoglobulin for secretion. For cutaneous cells, e.g. keratmocytes 
particularly keratinocyte stem cells . one could provide for protection agamst .nfecton by 
secreting alpha, beta or gamma interferon, antichemotactic factors, proteases speafic for 
bacterial cell wall proteins, various anti-viral proteins.efc. 

in various situations, one may wish to direct a cell to a particular site. The srte can 
include anatomical sites, such as lymph nodes, mucosal tissue, skin, synovium, lung or 
other internal organs or functional sites, such as clots, injured sites, sites of surg-cat 
manipulation, inflammation, infection, etc. Regulated expression of a membrane protem 
which recognizes or binds to the particular site of interest, for example, provides a method 
tor directing the engineered cells to that site. Thus one can achieve a locahzed 
concentration of a secreted product or effect cell-based healing, scavenging, protects from 
infection, anti-tumor activity, etc. Proteins of interest include homing receptors, e.g. L- 
selectin. GMP140. CLAM-1 . etc.. or addressins. e.g. ELAM-1.PNAd. LNAd. etc dot 
binding proteins, or ceil surface proteins that respond to localized gradients of chemotactic 
fsctors 

' in one embodiment, recognition elements for a DNA binding domain of one ol the 
subject fusion proteins are introduced into the host cells such that they are operatively 
linked to an endogenous target gene. e.g. by homologous recombination with generic 
DNA A variety of suitable approaches s are available. See. e.g.. PCT publ.cat.ons 
WO93/09222. WO95/31560. W096/2941 1 . WO95/31560 and WO94/12650. Th.s 
permits ligand -mediated regulation of the transcription of the endogenous gene. 

(b) Minimal Promoters. 

Minimal promoters which may be incorporated into a target gene construct (or other 
construct of the invention) may be selected from a wide variety of known s ^ uen ~ s ; 
including promoter regions from fos. hCMV. SV40 and IL-2. among many others. Illustrate 
examples are provided which use a minimal CMV promoter or a minimal IL2 gene promoter 
(-72 to +45 with respect to the start site; Siebenlist et al.. MCB 6:3042-3049. 1986) 

(c) DNA recognition sequences. 

The choice of recognition sequences to use in the target gene construct .s .n some 
cases determined by the nature of the regulatory system to be employed. 
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Where the target gene construct comprises an endogenous gene with its own 
regulatory DNA, the recognition sequence is thereby provided by the cells, and the 
practitioner provides a DNA binding domain which recognizes it. 

In systems relying on a tetR or RXR-type DNA binding domain, the recognition 
sequence is again ususally predetermined (by the choice of tetR or RXR-type DNA 
binding domain). 

In other cases, e.g.. in ligand-mediated crosslinking systems and systems like the 
progesterone receptor-based system, a diverse set of DNA binding domain:recogn,t,on 
sequence choices are available to the practitioner. 

Recognition sequences for a wide variety of DNA-binding domains are known. 
DNA recognition sequences for other DNA binding domains may be determined 
experimentally. In the case of a composite DNA binding domain. DNA recognition 
sequences can be determined experimentally, or the proteins can be mandated to d.rect 
their specificity toward a desired sequence. A desirable nucleic acid recognrt.cn sequence 
for a composite DNA binding domain consists of a nucleotide sequence spanning at .east 
ten. preferably eleven, more preferably twelve or more, and even more preferably some 
cases eighteen bases. The component binding portions (putative or demonstrated) vrttun 
the nucleotide sequence need not be fully contiguous; they may be interspersed wrth 
-spacer- base pairs that need not be directly contacted by the chimeric protein but rather 
impose proper spacing between the nucleic acid subsites recognized by each module. 
These sequences should not impart expression to linked genes when introduced into cells 
in the absence of the engineered DNA-binding protein. 

To identify a nucleotide sequence that is recognized by a chimenc protein 
containing a DNA-binding region, preferably recognized with high affinity (dissoc.at.on 
constant 10- 1 1 M or lower are especially preferred), several methods can be used. If h.gh- 
affmity binding sites for individual subdomains of a composite DNA-binding region are 
already known, then these sequences can be joined with various spacing andonen aton 
and the optimum configuration determined experimentally (see below for methods for 
determining affinities). Alternatively, high-affinity binding sites for the protein or protem 
complex can be selected from a large pool of random DNA sequences by adaptation of 
published methods (Pollock. R. and Treisman. R.. 1990. A sensitive 

determination of protein-DNA binding specificities. Nud. Acids Res. 18. 6197-6204). Bound 
sequences are cloned into a plasmid and their precise sequence and affinrty for the 
proteins are determined. From this collection of sequences, individual sequences wrth 
desirable characteristics (/.e.. maxima, affinity for composite protein, minima, affinrty for 

individual subdomains) are selected for use. Alternative*, the co.lect.on of sequences m 

used to derive a consensus sequence that carries the favored base pairs at each pos.tion. 

Such a consensus sequence is synthesized and tested to confirm that it has an 

appropriate level of affinity and specificity. 
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The target gene constructs may contain multiple copies of a DNA recognition 
sequence. For instance, the constructs may contain 5. 8. 10 or 12 r cognition sequences tor 
GAL4 or tor ZFHD1 . 

Design and assembly of the DNA constructs 

Constructs may be designed in accordance with the principles, illustrative 
examples and materials and methods disclosed in the patent documents and scientif.c 
literature cited herein, with modifications and further exemplification as descnbed. 
Components of the constructs can be prepared in conventional ways, where the coding 
sequences and regulatory regions may be isolated, as appropriate, ligated, cloned in an 
appropriate cloning host, analyzed by restriction or sequencing, or other convenient 
means Particularly, using PCR, individual fragments including all or portions of a functional 
unit may be isolated, where one or more mutations may be introduced using 'pnmer repair", 
ligation, in vitro mutagenesis, etc. as appropriate. In the case of DNA constructs encoding 
fusion proteins, DNA sequences encoding individual domains and sub-doma.ns are joined 
such that they constitute a single open reading frame encoding a fusion protein capable of 
being translated in cells or cell lysates into a single polypeptide harboring all component 
domains. The DNA construct encoding the fusion protein may then be placed into a vector 
for transducing host cells and permitting the expression of the protein. For biochemical 
analysis of the encoded chimera, it may be desirable to construct plasmids that d.rect the 
expression of the protein in bacteria or in reticulocyte-lysate systems. For use in the 
production of proteins in mammalian cells, the protein^ncoding sequence is introduced into 
an expression vector that directs expression in these cells. Expression vectors suitable for 
such uses are well known in the art Various sorts of such vectors are commercially 
available. 

Introduction of Constructs into Cells 

This invention is particularly useful for the engineering of animal cells and in 
applications involving the use of such engineered animal cells. The animal cells may be 
insect, worm or mammalian cells. While various mammalian cells may be used, including, by 
way of example, equine, bovine, ovine, canine, feline, murine, and non-human pnmate 
cells, human and mouse cells are of particular interest Across the various species, vanous 
types of cells may be used, such as hematopoietic, neural, glial, mesenchymal, cutaneous, 
mucosal, stromal, muscle (including smooth muscle cells), spleen, reticuloendothelial, 
epithelial, endothelial, hepatic, kidney, gastrointestinal, pulmonary, fibroblast, and other cell 
types. Of particular interest are muscle cells (including skeletal, cardiac and other muscle 
cells), cells of the central and peripheral nervous systems, and hematopoietic cells, wn.cn 
may include any of the nucleated cells which may be involved with the erythroid, 
lymphoid or myelomonocytic lineages, as well as myoblasts and fibroblasts. Also of 



37 



WO 99/10510 



PCT/US 98/ 17723 



interest are stem ancLprogenitor cells, such as hematopoietic, neural, stromal, muscle. 

hepatic, pulmonary, gastrointestinal and mesenchymal stem cells 

The cells may be autologous cells, syngeneic cells, allogeneic cells and even in 

some cases, xenogeneic cells with respect to an intended host organism. The cells may 

be modified by changing the major histocompatibility complex ('MHO profile, by 

inactivating B2-microglobulin to prevent the formation of functional Class I MHC molecules. 

inactivation of Class II molecules, providing lor expression of one or more MHC molecules 

enhancing or inactivating cytotoxic capabilities by enhancing or inhibiting the expression of 

genes associated with the cytotoxic activity, and the like. 

In some instances speafic clones or oligoclonal cells may be of interest, where the 

cells have a particular specificity, such as T cells and B cells having a specific antigen 

specificity or homing target site specificity. 

Constructs encoding the fusion proteins and comprising target genes of this 
invention can be introduced into the cells as one or more nudeic acid molecules or 
constructs, in many cases in association with one or more markers to allow for select.on of 
host cells which contain the construct(s). The constructs can be prepared in conventional 
ways where the coding sequences and regulatory regions may be isolated, as 
appropriate, ligated, cloned in an appropriate cloning host, analyzed by restnctjon or 
sequencing, or other convenient means. Particularly, using PCR. individual fragments 
including all or portions of a functional domain may be isolated, where one or more 
mutations may be introduced using -primer repair*, ligation, in vitro mutagenesis, etc. as 
appropriate. 

The construct(s) once completed and demonstrated to have the appropnate 
sequences may then be introduced into a host cell by any convenient means. The 
constructs may be incorporated into vectors capable of episomal replication (e.g. BPV or 
EBV vectors) or into vectors designed for integration into the host cells' chromosomes. The 
constructs may be integrated and packaged into non-replicating, defective viral genomes 
like Adenovirus. Adeno-associated virus (AAV), or Herpes simplex virus (HSV) or others, 
including retroviral vectors, for infection or transduction into cells. Alternatively, the 
construct may be introduced by protoplast fusion, etectroporation. biolistics, calcium 
phosphate transection, lipofection. microinjection of DNA or the like. The host cells w.ll .n 
some cases be grown and expanded in culture before introduction of the constructs), 
followed by the appropriate treatment for introduction of the construct(s) and integration of 
the construct(s). The cells will then be expanded and screened by virtue of a marker 
present in the constructs. Various markers which may be used successfully include hprt, 
neomycin resistance, thymidine kinase, hygromycin resistance, etc.. and vanous cell- 
surface markers such as Tac, CD8. CD3. Thy1 and the NGF receptor. 

In some instances, one may have a target site tor homologous recombinat.on. 
where H is desired that a constiuct be integrated at a particular locus. For example, one can 
delete and/or replace an endogenous gene (at the same locus or elsewhere) with a 
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recombinant target construct of this invention. For homologous r ^™*^^* 
oenerally use either a or O-vectors. See, tor example, Thomas and Capecchi. df (1987) 
51. 503-512; Mansour, eta/.. Nature (1988) 336. 348-352; and Joyner. eta/.. Nature 

(1989) 338. 1 53-156. 

The constructs may be introduced as a single DNA molecule encoding all of the 
genes or different DNA molecules having one or more genes. The constructs may be 
introduced simultaneously or consecutively, each with the same or different markers. 

Vectors containing useful elements such as bacterial or yeast ongms of repl.cat.on. 
selectable and/or amplifiable markers, promoter/enhancer elements for expression in 
prokaryotes or eukaryotes. and mammaiian expression contro. elements, etc. wh.ch may 
be used to prepare stocks of construct DNAs and for carrying out transfers are well 
known in the art. and many are commercially available. 

Introduction of Constructs into Animals 

Any means for the introduction of genetically engineered cells or heterologous DNA 
into animals, preferably mammals, human or non-human, may be adapted to the practice of 
this invention for the delivery of the various DNA constructs into the intended 
the purpose of this discussion, the various DNA constructs descnbed herem may together 
be referred to as the transgene. 

bY ^ ^^ZZ^~^ ex vivo or in vitro with the DNA constructs may 
be grown in culture under selective conditions and cells which are selected as 
desired construct(s) may then be expanded and further analyzed, us.ng. for example the 
polymerase chain reaction for determining the presence of the construct in the host oeHs 
and/or assays for the production of the desired gene produces). After be.ng transduced 
with the heterologous genetic constructs, the modified host cells may be identrf,ed. 
selected, gorwon. characterized, etc. as desired, and then may be used as planned, e.g. 
grown in culture or introduced into a host organism. 

Depend.ng upon the nature of the cells, the cells may be introduced mto a host 
organism, e.g. a mammal, in a wide variety of ways, generally by injection or implantation 
into the desired tissue or compartment, or a tissue or compartment permitting m.grauon of 
the cells to their intended destination. Illustrative sites for injection or implantation 
vascular system, bone marrow, muscle, liver, cranium or spina, cord, peritoneum, and Stan. 
5 Hematopoietic cells, for example, may be administered by injection into the vascular 
system, there being usually at least about 10* cells and generally not more than about 
1010 cells. The number of cells which are employed will depend upon the circumstances, 
the purpose for the introduction, the lifetime of the cells, the protocol to be used, for 
example, the number of administrations, the ability of the cells to multiply, the stab.lrty of 
,o the therapeutic agent, the physiologic need for the therapeutic agent, and the like. 
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Generally, for myoblasts or fibroblasts for example, the number of cells will be at least 
about 104 and not more than about 109 and may be applied as a dispersion, generally 
being injected at or near the site of interest. The cells will usually be in a physiologically- 
acceptable medium. 

Cells engineered in accordance with this invention may also be encapsulated, e.g. 
using conventional biocompatible materials and methods, prior to implantation into the host 
organism or patient for the production of a therapeutic protein. See e.g. Hguyen et al. 
Tissue implant Systems and Methods for Sustaining viable High Cell Densities with.n a 
Host US Patent No. 5.314,471 (Baxter International. Inc.); Uludag and Sefton. 1993, J 
Biomed Mater. Res. 27(10):l213-24 (HepG2 cells/hydroxyethyt methacrylate-methyl 
methaaylate membranes); Chang etal. 1 993. Hum Gene Ther 4(4);433^0 (mouse Ltk- 
cells expressing hGW.mmunoprotective perm-selective alginate microcapsules; Reddy et 
al 1993. J infect Dis 168(4):1082-3 (alginate); Tai and Sun. 1993. FASEB J 7(1 1):1061-9 
(mouse fibroblasts expressing hGH/alginate-poly-L-lysine-alginate membrane); Ao et al. 
1995 Transplantation Proc. 27(6):3349. 3350 (alginate); Rajotte et al. 1995. 
Transplantation Proc. 27(6):3389 (alginate); Lakey et al. 1995. Transplantation Proc. 
27(6)-3266 (alginate); Korbutt et al. 1995. Transplantation Proc. 27(6):3212 (algmate); 
Dorian et al. US Patent No. 5.429.821 (alginate); Emerich et al. 1993. Exp Neurol 
122(1) 37-47 (polymer-encapsulated PC12 cells); Sagen et al. 1993. J Neurosci 
13(6)5415-23 (bovine chromaffin cells encapsulated in semipermeable polymer membrane 
and implanted into rat spinal subarachnoid space); Aebischer et al, 1994. Exp Neurol 
126(2):151-8 (polymer-encapsulated rat PC12 cells implanted into monkeys; see also 
Aebischer WO 92/19595); Savelkoul etal. 1994, J Immunol Methods 170(2):185-96 
(encapsulated hybridomas producing antibodies; encapsulated transfected cell l.nes 
expressing various cytokines); Winn et al. 1994. PNAS USA 91(6)2324-8 (engineered 
BHK cells expressing human nerve growth factor encapsulated in an immunoisoiation 
polymeric device and transplanted into rats); Emerich et al. 1994, Prog 
Neuropsychopharmacol Biol Psychiatry 18(5):93EM6 (polymer-encapsulated PC12 cells 
implanted into rats); Kordower et al. 1994. PNAS USA 91 (23): 10898-902 (polymer- 
encapsulated engineered BHK celts expressing hNGF implanted into monkeys) and Butler 
et al WO 95/04521 (encapsulated device). The cells may then be introduced in 
encapsulated form into an animal host, preferably a mammal and more preferably a human 
subject in need thereof. Preferably the encapsulating material is semipermeable, permuting 
release into the host of secreted proteins produced by the encapsulated cells. In many 
embodiments the semipermeable encapsulation renders the encapsulated cells 
immunologically isolated from the host organism in which the encapsulated cells are 
introduced. In those embodiments the cells to be encapsulated may express one or more 
fusion proteins containing component domains derived from proteins of the host spec.es 
and/or from viral proteins or proteins from species other than the host spec.es. The cells 
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may be derived from one or more individuals other than the recipient and may be derived 
from a species other than that of the recipient organism or patient. 

by in vivo genetic engineering 

Instead of ex vivo modification of the ceils, in many situations one may wish to 
modify cells in vivo. A variety of techniques have been developed for genetic eng.neer.ng 
of target tissue and cells in vivo, including viral and non-viral systems. 

In one approach, the DNA constructs are delivered to cells by transfect.on. i.e., by 
delivery to cells of 'naked DNA", lipid-complexed or liposome-formulated DNA. or otherw.se 
formulated DNA. Prior to formulation of DNA. e.g.. with lipid, or as in other approaches, pnor 
to incorporation in a final expression vector, a plasmid containing a transgene beanng the 
desired DNA constructs may first be experimentally optimized for express.on (e.g.. 
inclusion of an intron in the 5' untranslated region and elimination of unnecessary 
sequences (Feigner, et al.. Ann NY Acad Sci 126-139. 1995). Formulation of DNA. e.g. 
with various lipid or liposome materials, may then be effected using known methods and 
materials and delivered to the recipient mammal. See. e.g.. Canonico et al. Am J Resp.r 
Cell Mol Biol 1024-29. 1994 (in vivo transfer of an aerosolized recombinant human atphal- 
antitrypsin gene complexed to cationic liposomes to the lungs of rabbits); Tsan et al. Am J 
Physiol 268 (Lung Cell Mol Physiol 12): L1052-L1056. 1995 (transfer of genes to rat lungs 
via tracheal insufflation of plasmid DNA alone or complexed with cationic liposomes); Alton 
et at. Nat Genet. 5:135-142. 1993 (gene transfer to mouse airways by nebulized delivery 
of cDN A-liposome complexes). In either case, delivery of vectors or naked or formulated 
DNA can be carried out by instillation via bronchoscopy, after transfer of viral parties to 
Ringer's, phosphate buffered saline, or other similar vehicle, or by nebul.zat.on. 

Viral systems include those based on viruses such as adenovirus, adeno- 
associated virus, hybrid adeno-AAV. lentivirus and retroviruses, which allow for 
transduction by infection, and in some cases, integration of the virus or transgene into the 
host genome. See. for example. Dubensky et al. (1984) Proc. Natl. Acad. Sc.. USA 81 , 
7529-7533; Kaneda et al.. (1989) Science 243.375-378; Hiebert et al. (1989) Proc NatL 
Acad Sci USA 86. 3594-3598; Hatzoglu et al. (1990) J. Biol. Chem. 265. 17285-17293 
and Ferry, et al. (1 991 ) Proc. Natl. Acad. Sci. USA 88. 8377-8381 . The virus may be 
administered by injection (e.g. intravascularly or intramuscularly), inhalation, or other 
parenteral mode. Non-viral delivery methods such as administration of the DNA v.a 
complexes with liposomes or by injection, catheter or biolistics may also be used. See e.g. 
WO 96/41 865. PCTAJS97/22454 and USSN 60/08481 9. for example, for additional 
guidance on formulation and delivery of recombinant nucleic acids to cells and to organ.sms. 

By employing an attenuated or modified retrovirus carrying a target transcnpt.onal 
initiation region, if desired, one can activate the virus using one of the subject transcnpfon 
factor constructs, so that the virus may be produced and transduce adjacent cells. 
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The use of recombinant viruses to deliver the nucleic acid constructs are of 
particular interest. The transgene(s) may be incorporated into any of a van ty of viruses 

useful in gene therapy. 

In clinical settings, the gene delivery systems (i.e.. the recomb,nant nucie.c adds in 
vectors virus, lipid formulation or other form) can be introduced into a patient, e.g.. by any 
of a number of known methods. For instance, a pharmaceutical preparation of the gene 
delivery system can be introduced systemically. e.g. by intravenous injection, inhalation, 
etc. In some systems, the means of delivery provides tor specific or selective transduction 
of the construct into desired target cells. This can be achieved by regional or local 
administration (see U.S. Patent 5.328,470) or by stereotactic injection, e.g. Chen et aL. 
(1 994) PNAS USA 91 : 3054-3057 or by determinants of the delivery means. For instance, 
some viral systems have a tissue or cell-type specificity for infection. In some systems 
cell-type or tissue-type expression is achieved by the use of cell-type or tissue-specific 
expression control elements controlling expression of the gene. 

Those references as well as the references cited previously, including those 
relating to tetR-based systems, progesterone-receptor-based systems and ecdysone- 
based systems, provide detailed additional guidance on the preparation, formulation and 
delivery of various ligands to cells in vitro and to organisms. 

In preferred embodiments of the invention, the subject expression constructs are 
derived by incorporation of the genetic construct(s) of interest into viral delivery systems 
including a recombinant retrovirus, adenovirus, adeno-asscciated virus (AAV), hybnd 
adenovirus/AAV, herpes virus or lentivirus (although other applications may be earned out 
using recombinant bacterial or eukaryotic plasmids). While various viral vectors may be 
used in the practice of this invention. AAV- and adenovirus-based approaches are of 
particular interest for the transfer of exogenous genes in vivo, particularly into humans and 
other mammals. The following additional guidance on the choice and use of viral vectors 
may be helpful to the practitioner, especially with respect to applications involving whole 
animals (including both human gene therapy and the development and use of animal model 
systems), whether ex vivo or in vivo. 



Viral Vectors: 



Adenoviral vectors 

A viral gene delivery system useful in the present invention utilizes adenovirus- 
derived vectors. Knowledge of the genetic organization of adenovirus, a 36 kB. linear and 
double-stranded DNA virus, allows substitution of a large piece of adenoviral DNA with 
foreign sequences up to 8 kB. In contrast to retrovirus, the infection of adenoviral DNA into 
host cells does not result in chromosomal integration because adenoviral DNA can replicate 
in an episomal manner without potential genotoxicity. Also, adenoviruses are structurally 
stable, and no genome rearrangement has been detected after extensive amplif.cat.on. 



42 



WO 99/10510 



PCT/US98/17723 



10 



15 



20 



25 



30 



35 



Adenovirus can infect virtually all epithelial cells regardless of their cell cycle stage. So far, 
adenoviral infection appears to be linked only to mild dis ase such as acute respiratory 

disease in the human. 

Adenovirus is particularly suitable for use as a gene transfer vector because of its 
mid-sized genome, ease of manipulation, high titer, wide target-cell range, and high 
inf activity. Both ends of the viral genome contain 100-200 base pair (bp) inverted terminal 
repeats (ITR). which are cis elements necessary for viral DNA replication and packaging. 
The early (E) and late (L) regions of the genome contain different transcription domains that 
are divided by the onset of viral DNA replication. The E1 region (E1 A and E1B) encodes 
proteins responsible for the regulation of transcription of the viral genome and a few cellular 
genes. The expression of the E2 region (E2A and E2B) results in the synthesis of the 
proteins for viral DNA replication. These proteins are involved in DNA replication, late gene 
expression, and host cell shut off (Renan (1990) Radiotherap. Oncol. 19:197). The 
products of the late genes, including the majority of the viral capsid proteins, are expressed 
only after significant processing of a single primary transcript issued by the major late 
promoter (MLP). The MLP (located at 16.8 m.u.) is particularly efficient during the late 
phase of infection, and all the mRNAs issued from this promoter possess a 5* tripartite 
leader (TL) sequence which makes them preferred mRNAs for translation. 

The genome of an adenovirus can be manipulated such that it encodes a gene 
product of interest, but is inactivated in terms of its ability to replicate in a normal lytic viral 
life cycle (see. for example, Berkner et al.. (1988) BioTechniques 6:616; Rosenfeld et al.. 
(1991) Science 252:431-434; and Rosenfeld et al.. (1992) Cell 68:143-155). Suitable 
adenoviral vectors derived from the adenovirus strain Ad type 5 d!324 or other strains of 
adenovirus (e.g.. Ad2. Ad3. Ad7 etc.) are well known to those skilled in the art. 
Recombinant adenoviruses can be advantageous in certain circumstances in that they are 
not capable of infecting nondividing cells and can be used to infect a wide variety of cell 
types, including airway epithelium (Rosenfeld et al.. (1992) cited supra), endothelial cells 
(Lemarchand et al.. (1992) PNAS USA 89:6482-6486). hepatocytes (Hera and Gerard. 
(1993) PNAS USA 90:2812-2816) and muscle cells (Quantin et al.. (1992) PNAS USA 
892581-2584). Adenovirus vectors have also been used in vaccine development 
(Grunhaus and Horwitz (1992) Seminar in Virology 3:237; Graham and Prevec (1992) 
Biotechnology 20:363). Experiments in administering recombinant adenovirus to different 
tissues include trachea instillation (Rosenfeld et al. (1991) ; Rosenfeld et al. (1992) Cell 
68:143). muscle injection (Ragot et al. (1993) Nature 361:647). peripheral intravenous 
injection (Herz and Gerard (1 993) Proc. Natl. Acad. Sci. U.S.A. 90:2812). and stereotactic 
inoculation into the brain (Le Gal La Salle et al. (1993) Science 254:988). 

Furthermore, the virus particle is relatively stable and amenable to purification and 
concentration, and as above, can be modified so as to affect the spectrum of infectivity. 
Additionally, adenovirus is easy to grow and manipulate and exhibits broad host range ,n 
vitro and in vivo. This group of viruses can be obtained in high titers, e.g.. 109 - 101 
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plaque-forming unit (PFU)/ml. and they are highly infective. The life cycle of adenovirus 
does not require integration into the host cell genome. The foreign genes delivered by 
adenovirus vectors are episomal. and therefore, have low genotoxicity to host cells. No 
side effects have been reported in studies of vaccination with wild-type adenovirus 
(Couch et al.. 1963; Top et al.. 1971), demonstrating their safety and therapeutic potential 
as in vivo gene transfer vectors. Moreover, the carrying capacity of the adenoviral 
qenome for foreign DN A is large (up to 8 kilobases) relative to other gene delivery vectors 
(Berkner et al.. supra; Haj-Ahmand and Graham (1 986) J. Virol. 57:267). Most replication- 
defective adenoviral vectors currently in use and therefore favored by the present 
invention are deleted for all or parts of the viral E1 and E3 genes but retain as much as 
80% of the adenoviral genetic material (see. e.g.. Jones et al.. (1 979) Cell 16:683; Berkner 
et al supra- and Graham et al.. in Methods in Molecular Biology. E.J. Murray, Ed. 
(Humana, Clifton. NJ. 1991) vol. 7. pp. 109-127). Expression of the inserted gene can be 
under control of. tor example, the E1 A promoter, the major late promoter (MLP) and 
associated leader sequences, the viral E3 promoter, or exogenously added promoter 
sequences. 

Other than the requirement that the adenovirus vector be replication defective, or 
at least conditionally defective, the nature of the adenovirus vector is not believed to be 
crucial to the successful practice of the invention. The adenovirus may be of any of the 42 
different known serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the 
preferred starting material in order to obtain the conditional replication^lefective adenovirus 
vector tor use in the method of the present invention. This is because Adenov.rus type 5 
is a human adenovirus about which a great deal of biochemical and genetic information .s 
known, and it has historically been used for most constructions employing adenov.rus as 
a vector As stated above, the typical vector according to the present invention is 
replication defective and will not have an adenovirus E1 region. Thus, it will be most 
convenient to introduce the nucleic acid of interest at the position from which the E1 coding 
sequences have been removed. However, the position of insertion of the nucle.c acid of 
interest in a region within the adenovirus sequences is not critical to the present mvent.cn. 
For example, the nucleic acid of interest may also be inserted in lieu of the deleted E3 
region in E3 replacement vectors as described previously by Karisson eL al. (1986) or in 
the E4 region where a helper cell line or helper virus complements the E4 defect. 

A preferred helper cell line is 293 (ATCC Accession No. CRL1573). This helper 
cell line also termed a "packaging cell line" was developed by Frank Graham (Graham et 
al (1987) J Gen. Virol. 36:59-72 and Graham (1977) J.General Virology 68:937-940) and 
provides E1 A and E1B in trans. However, helper cell lines may also be derived from 
human cells such as human embryonic kidney cells, muscle cells, hematopoietic cells or 
other human embryonic mesenchymal or epithelial cells. Alternatively, the helper cells may 
be derived from the cells of other mammalian species that are permissive for human 
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adenovirus. Such cells include, e.g.. Vero cells or other monkey embryonic mesenchymal 

° f 6Pith Vano^adenovirus vectors have been shown to be of use in the transfer of genes 
to mammals, including humans. Replication-deficient adenovirus vectors have been used 
to express marker proteins and CFTR in the pulmonary epithelium. Because of the.r ab.hty 
to efficiently infect dividing cells, their tropism for the lung, and the relative ease of 
generation of high titer stocks, adenoviral vectors have been the subject of much .research 
fn the last few years, and various vectors have been used to deliver genes to t e lungs o, 
human subjects (Zabner et al.. Cel. 75507-216. 1993; Crystal, et a... Nat Genet. 8^42-51 . 
1994- Boucher et a... Hum Gene Ther 5:615-639. 1994). The first generate E1a deleted 
adenovirus vectors have been improved upon with a second generation that includes a 
tempeLure-sensitive E2a viral protein, designed to express less viral prote.n and thereby 
make the virally infected cell less of a target for the immune system (Goldman et al.. Human 
Gene Therapy 6:839-851 .1 995). More recently, a viral vector deleted of all viral open 
reading frames has been reporied (Rsher et a... Virology 217:1 1-22. 1996). Morec^ ,t 
has been shown that expression of viral IL-10 inhibits the immune response to adenovra. 
antiaen (Qin et al., Human Gene Therapy 8:1 365-1 374, 1 997). 

Adenoviruses can also be cel. type specific, i.e.. infect only restricted types of ceUs 
and/or express a transgene only in restricted types of cells. For example, the viruses 
comprise a gene under the transcriptional control of a transcription ihmaton regK>n 
specifically regulated by target host cells, as described e.g.. in U.S. Patent No ^698.443. 
by Henderson and Schuur. issued December 16. 1997. Thus, repl.cat.on competent 
adenoviruses can be restricted to certain cells by. e.g., inserting a cell specie response 
element to regulate a synthesis of a protein necessary for replication e.g.. E1A or E1B. 

DNA sequences of a number of adenovirus types are available from Genbank. For 
example, human adenovirus type 5 has GenBank Accession NO.M73260. The 
adenovirus DNA sequences may be obtained from any of the 42 human adenov,rus types 
currently identified. Various adenovirus strains are available from the Amencan Type 
Culture Collection. Rockville. Maryland, or by request from a number of commercial and 
academic sources. A transgene as described herein may be incorporated into any 
adenoviral vector and delivery protocol, by the same methods (restriction digest, linker 
ligation or filling in of ends, and ligation) used to insert the CFTR or other genes .nto the 

VeCt ° rS Adenovirus producer cell lines can include one or more of the adenoviral genes E1 . 
E2a. and E4 DNA sequence, for packaging adenovirus vectors in which >on .or ^more 
these genes have been mutated or deleted are desenbed. e.g.. .n PCT/US95/1 5947 (WO 
96/18416) by Kadan et al.; PCTAJS95/07341 (WO 95/346671) by Kovesdi et aL. 
PCT/FR94/00624 (W094/28152) by Imler et a.,PCT/FR94/00851 (WO 95/02697) by 
Perrocaudet et al.. PCT/US95/1 4793 (WO96/14061 ) by Wang et al. 
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AAV Vectors , . 

Another viral vector system useful for delivery of DNA is the adeno-assocated 
virus (AAV). Adeno-associated virus is a naturally occurring defective virus that requ.res 
another virus, such as an adenovirus or a herpes virus, as a helper virus for etfiaent 
replication and a productive life cycle. (For a review, see Muzyczka et al.. Curr. Top.cs ,n 
Micro, and Immunol. (1992) 158:97-129). 

AAV has not been associated with the cause of any disease. AAV is not a 
transforming or oncogenic virus. AAV integration into chromosomes of human cell lines does 
not cause any significant alteration in the growth properties or morphological charactenst.cs 
of the cells, These properties of AAV also recommend it as a potentially useful human gene 
therapy vector. 

AAV is also one of the few viruses that may integrate its DNA into non-dividing 
cells e g pulmonary epithelial cells or muscle cells, and exhibits a high frequency of stable 
integration (see for example Flotte et al.. (1992) Am. J. Respir. Cell. Mol. Biol. 7:349-356; 
Samulski et at., (1 989) J. Virol. 63:3822-3828; and McLaughlin et al.. (1 989) J. Virol. 
62-1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged 
and can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector 
such as that described in Tratschin et al.. (1985) Mol. Cell. Biol. 5:3251-3260 can be used 
to introduce DNA into cells. A variety of nucleic acids have been introduced into different 
cell types using AAV vectors (see for example Hermonat et al., (1984) PNAS USA 
81 -6466-6470; Tratschin et al.. (1985) Mol. Cell. Biol. 42072-2081 ; Wondisford et al.. 
(1988) Mol. Endocrinol. 2:32-39; Tratschin et al.. (1984) J. Virol. 51:611-619; and Flotte et 
al.. (1993) J. Biol. Chem. 268:3781-3790). 

The AAV-based expression vector to be used typically includes the 145 nucleot.de 
AAV inverted terminal repeats (ITRs) flanking a restriction site that can be used for 
suctioning of the transgene. either directly using the restriction site available, or by 
excision of the transgene with restriction enzymes followed by blunting of the ends, ligation 
of appropriate DNA linkers, restriction digestion, and ligation into the site between the ITRs. 
The capacity of AAV vectors is about 4.4 kb. The following proteins have been 
expressed using various AAV-based vectors, and a variety of promoter/enhancers: 
neomycin phosphotransferase, chloramphenicol acetyl transferase. Fanconi's anemia gene, 
cystic fibrosis transmembrane conductance regulator, and granulocyte macrophage colony- 
stimulating factor (Kotin. R.M.. Human Gene Therapy 5:793-801. 1994. Table I). A 
transgene incorporating the various DNA constructs of this invention can similarly be 
included in an AAV-based vector. As an alternative to inclusion of a constitutive promoter 
such as CMV to drive expression of the recombinant DNA encoding the fus.on prote.n(s). 
e g fusion proteins comprising an activation domain or DNA-binding domain, an AAV 
promoter can be used (ITR itself or AAV P 5 (Flotte. et al. J. Biol.Chem. 268:3781-3790. 
1993)). 
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Such a vector can be packaged into AAV virions by reported methods. For 
example a human cell line such as 293 can be co-transfected with the AAV-based 
expression vector and another plasmid containing open reading frames encoding AAV rep 
and cap (which are obligatory for replication and packaging of the recomb.nant viral 
construct) under the control of endogenous AAV promoters or a heterologous promoter. In 
the absence of helper virus, the rep proteins Rep68 and Rep78 prevent accumulation of 
the replicative form, but upon superinfection with adenovirus or herpes virus, these 
proteins permit replication from the ITRs (present only in the construct conta.n.ng the 
transgene) and expression of the viral capsid proteins. This system results in packag.ng 
of the transgene DNA into AAV virions (Carter. B.J.. Current Opinion in Biotechnology 
3-533-539 1992" Kotin. B.M. Human Gene Therapy 5:793-801 . 1994)). Typically, three 
days after 'transaction, recombinant AAV is harvested from the cells along with adenov,rus 
and the contaminating adenovirus is then inactivated by heat treatment 

Methods to improve the titer of AAV can also be used to express the transgene in 
an AAV virion. Such strategies include, but are not limited to: stable expression of the 
ITR-flanked transgene in a ceil line followed by transfection with a second plasmid to d.rect 
viral packaging; use of a cell line that expresses AAV proteins inducibly. such as 
temperature-sensitive inducible expression or pharmacologically inducible express.on. 
Alternatively a cell can be transformed with a first AAV vector including a 5' FTR, a 3' ITR 
flanking a heterologous gene, and a second AAV vector which includes an inducible ong.n 
of replication, e.g.. SV40 origin of replication, which is capable of being induced 
agent such as the SV40 T antigen and which includes DNA sequences encodmg the AAV 
rep and cap proteins. Upon induction by an agent, the second AAV vector may replicate 
to a high copy number, and thereby increased numbers of infectious AAV particles may be 
generated (see. e.g. U.S. Patent No. 5.693.531 by Chiorini et a... issued December 2. 
1997 In yet another method for producing large amounts of recombinant AAV. a plasmid is 
used which incorporate the Epstein Barr Nuclear Antigen (EBNA) gene . the latent ong.n of 
replication of Epstein Barr virus (oriP) and an AAV genome. These plasm.ds are 
maintained as a multicopy extra-chromosomal elements in cells, such as in 293 cells. Upon 
addition of wild-type helper functions, these cells will produce high amounts of recombinant 
AAV (U S Patent 5.691 .176 by Lebkowski et al.. issued Nov. 25. 1997). In another 
system, an AAV packaging plasmid is provided that allows expression of the rep gene, 
wherein the p5 promoter, which normally controls rep expression, is replaced wrth a 
heterologous promoter (U.S. Patent 5.658.776. by Flotte et al.. issued Aug. 19. 1997). 
Additionally, one may increase the efficiency of AAV transduction by treating the cells w^th 
an agent that facilitates the conversion of the single stranded form to the double stranded 
form, as described in Wilson et al.. WO96/39530. 

AAV stocks can be produced as described in Hermonat and Muzyczka (1984) 
PNAS 81 :6466. modified by using the pAAV/Ad described by Samulski et al. (1989) J. 
Virol 63:3822. Concentration and purification of the virus can be achieved by reported 
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methods such as banding in cesium chloride gradients, as was used for the initial report of 
AAV vector expression in vivo (Flotte. et al. J.Biol. Chem. 268:3781-3790, 1993) or 
chromatographic purification, as described in O-Riordan et al., WO97/08298. 

Methods for in vitro packaging AAV vectors are also available and have the 
advantage that there is no size limitation of the DNA packaged into the particles (see, U.S. 
Patent No. 5.688.676. by Zhou et al., issued Nov. 18, 1997). This procedure involves the 
preparation of cell free packaging extracts. 

For additional detailed guidance on AAV technology which may be useful in the 
practice of the subject invention, including methods and materials for the incorporation of a 
transgene, the propagation and purification of the recombinant AAV vector conta.ning the 
transgene. and its use in transfecting cells and mammals, see e.g. Carter et al. US Patent 
No 4 797.368 (10 Jan 1989); Muzyczka et al. US Patent No. 5.139.941 (18 Aug 1992); 
Lebkowski et al. US Patent No. 5.173.414 (22 Dec 1992); Srivastava. US Patent No. 
5,252,479 (12 Oct 1993); Lebkowski et al. US Patent No. 5,354,678 (11 Oct 1994); Shenk 
et al US Patent No. 5.436,145(25 July 1995); Chatteri.ee et at. US Patent No. 5,454.935 
(12 Dec 1995). Carter et al WO 93/24641 (published 9 Dec 1993). and Natsoul.s. U.S. 
Patent No 5 622.856 (April 22, 1997). Further information regarding AAVs and the 
adenovirus or herpes helper functions required can be lound in the following articles.Berns 
and Bohensky (1987), "Adeno-Associated Viruses: An Update', Advanced in Virus 
Research, Academic 

Press, 33:243-306. The genome of AAV is described in Laughlin et al. (1983) "Cloning of 
infectious adeno-associated virus genomes in bacterial plasmids", Gene, 23: 65-73. 
Expression of AAV is described in Beaton et al. (1989) "Expression from the 
Adeno-associated virus P 5 and P 19 promoters is negatively regulated in trans by the rep 
protein" J. Virol., 63:4450-4454. Construction of rAAV is described in a number of 
publications: Tratschin et al. (1984) -Adeno-associated virus vector for high frequency 
integration, expression and rescue of genes in mammalian cells". Mol. Cell. Biol.. 
4 2072-2081 ; Hermonat and Muzyczka (1984) "Use of adeno-associated virus as a 
mammalian DNA doning vector. Transduction of neomycin resistance into mammalian tssue 
culture cells*. Proc. Natl. Acad. Sci. USA, 81:6466-6470; McLaughlin etal. (1988) 
■Adeno-associated virus general transduction vectors: Analysis of Proviral Structures'. J. 
Virol.. 62:1963-1973; and Samulski et al. (1989) 'Helper-free stocks of recombinant 
adeno-associated viruses: normal integration do=qooe viral gene expression'. J. Virol.. 
63 3822-3828. Cell lines that can be transformed by rAAV are those descnbed in 
Lebkowski et al. (1 988) 'Adeno-associated virus: a vector system for efficient introduction 
and integration of DNA into a variety of mammalian cell types", Mol. Cell. Biol., 
8 3988-3996. 'Producer' or "packaging" cell lines used in manufacturing recombinant 
retroviruses are described in Dougherty et al. (1989) J. Virol.. 63:3209-3212; and 
Markowitz etal. (1988) J. Virol., 62:1120-1124. 
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Hybrid Adenovirus-AAV Vectors 

Hybrid Adenovirus-AAV vectors represented by an adenovirus caps.d containing a 
nucleic acid comprising a portion of an adenovirus, and 5' and 3' ITR sequences from an 
AAV which flank a selected transgene under the control of a promoter. See e.g. Wilson et 
al International Patent Application Publication No. WO 96/13598. This hybrid vector .s 
characterized by high titer transgene delivery to a host celt and the ability to stably 
integrate the transgene into the host cell chromosome in the presence of the rep gene. This 
virus is capable of infecting virtually all cell types (conferred by its adenovirus sequences) 
and stable long term transgene integration into the host cell genome (conferred by its AAV 
sequences). 

The adenovirus nucleic acid sequences employed in the this vector can range from 
a minimum sequence amount, which requires the use of a helper virus to produce the 
hybrid virus particle, to only selected deletions of adenovirus genes, which deleted gene 
products can be supplied in the hybrid viral process by a packaging cell. For example, a 
hybrid virus can comprise the 5' and 3' inverted terminal repeat (ITR) sequences of an 
adenovirus (which function as origins of replication). The left terminal sequence (5') 
sequence of the Ad5 genome that can be used spans bp 1 to about 360 of the 
conventional adenovirus genome (also referred to as map units 0-1 ) and includes the 5 
ITR and the packaging/enhancer domain. The 3' adenovirus sequences of the hybrid virus 
2 o include the right terminal 3' ITR sequence which is about 580 nucleotides (about bp 
35 353- end of the adenovirus, referred to as about map units 98.4-100. 

The AAV sequences useful in the hybrid vector are viral sequences from which the 
rep and cap polypeptide encoding sequences are deleted and are usually the cis acting 5 
and 3' ITR sequences. Thus, the AAV ITR sequences are flanked by the selected 
25 adenovirus sequences and the AAV ITR sequences themselves flank a selected 

transgene. The preparation of the hybrid vector is further described in detail in published 
PCT application entitled "Hybrid Adenovirus-AAV Virus and Method of Use Thereof, WO 
96/13598 by Wilson et al. 

For additional detailed guidance on adenovirus and hybrid adenovirus-AAV 
30 technology which may be useful in the practice of the subject invention, including methods 
and materials for the incorporation of a transgene. the propagation and purification of 
recombinant virus containing the transgene, and its use in transfecting cells and mammals, 
see also Wilson et al. WO 94/28938. WO 96/13597 and WO 96/26285. and references 
cited therein. 

36 

Retroviruses 

The retroviruses are a group of single-stranded RNA viruses characterized by an 
ability to convert their RNA to double-stranded DNA in infected cells by a process of 
reverse-transcription (Coffin (1990) Retroviridae and their Replication" In Fields. Kn.pe ed. 
40 Virology. New York: Raven Press). The resulting DNA then stably integrates into cellular 
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chromosomes as a provirus and directs synthesis of viral proteins. The integration resuhs 
n Pretention of the vira, gene sequences in the recipient cell and ts 
retrovira. genome contains three genes, gag. po.. and env that code for capsjda. prote,ns. 
po ymerase enzyme, and envelope components. respective.y. A sequence <°-d 
upstream from the gag gene, termed psi . functions as a signal * "J^^^^T 
into virions. Two long terminal repeat (LTR) sequences are present at the 5 and 3 ends 
of the viral genome. These contain strong promoter and enhancer sequences and are also 
required for integration in the host cell genome (Coffin (1990). SMpra). 

in order to construct a retrovira. vector, a nucleic acid of interest .s .nserted .nto the 
viral genome in the place of certain vira. sequences to produce a virus that « 
replication-defective, .n order to produce virions, a packaging cel. 
pol and env genes but without the LTR and psi components ,s constructed (Mann et a.. 
(1983) Cel. 3^:153). When a recombinant plasmid containing a human cDNA, together w, h 
the retrod ralLTR and psi sequences is introduced into this cel. fine (by ca.aum phosphate 
precfp itation for example), the psi sequence a.iows the RNA transcript of the re~mb,nant 
'pZTto be packaged into virai particles, which are then secreted into the cutture med,a 
(Nico.es and Rubenstein (1988) -Retroviral Vectors". In: Rodriguez anc ^Denhardt ed 
Vectors- A Survey of Molecular Coning Vectors and their Uses. Stoneham:Butterworth. 
Temin. (1986) -Retrovirus Vectors for Gene Transfer Efficient Integration into and 
Egression of Exogenous DMA in Vertebrate Cell Genome", .n: Kuchertapafc ed. Gene 
Transfer New York: Plenum Press; Mann et al.. 1 983. supra). The medrn contam.ng the 
recombinant retroviruses is then collected, optionally concentrated, and used for gene 
transfer Retroviral vectors are able to infect a broad variety of cell types^However 
llglation and stable expression require the division of host ceils (Paskind et a.. (1 975) 

^^Ama^orprerequisite for the use of retroviruses is to ensure the safety of their use. 
particularly with regard to the possibiiity of the spread *wi.d-type virus in the ^cell 
population The development of specialized ce.l lines (termed 'package celis ) wh,ch 
produce only replication^ ective retroviruses has increased the utility of retries , tor 
gene therapy, and defective retroviruses are well characterized for use ,n gene transfer for 
gene therapy purposes (for a review see Miller. A.D. (1990) Blood 76:271) Thus, 
recombinanue^irus can be constructed in which pan of the retroviral ood,ng sequen^ 
(gag. pol. env) has been replaced by nucleic acid encoding a fusion P rote.n of the ^sent 
LmL. rendering the retrovirus replication defective. The replication defecave reuowus 
is then packaged into virions which can be used to infect a target cel. through the use of a 
helper virus by standard techniques. Protocols for producing recombinant retrowuses and 
tor infecting cells in vitro or in vivo with such viruses can be found irHCurrent ^Protocc s ,n 
Molecular Biology. Ausubel. F.M. eta... (eds.) Greene Pubhshmg Associates. (1989). 
Sections 9 10-9 14 and other standard laboratory manuals. Examples of surtable 
, fe™es ^ude pU. pZIP. pWE and pEM which are we., known to those sk,.ed ,n the 
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art A preferred retroviral vector is a pSR MSVtkNeo (Mutter et al. (1 991 ) Mol. Cell B.ol. 
11-1785 and pSR MSV(Xbal) (Sawyers et al. (1995) J. Exp. Med. 181:307) and 
derivatives thereof. For example, the unique BamHI sites in both of these vectors can be 
removed by digesting the vectors with BamHI, filling in with Klenow and religatmg to 
produce P SMTN2 and pSMTX2, respectively, as described in PCT/US96/09948 by 
Clackson et al. Examples of suitable packaging virus lines for preparing both ecotropic 
and amphotropic retroviral systems include Crip, Cre. 2 and Am. 

Retroviruses have been used to introduce a variety of genes into many different 
cell types, including neural cells, epithelial cells, endothelial cells, lymphocytes, myoblasts, 
hepatocytes, bone marrow cells, in vitro and/or in vivo (see for example Egl.t.s et al.. 
(1985) Science 230:1395-1398; Danos and Mulligan. (1988) PNAS USA 85:6460-6464; 
Wilson et al.. (1988) PNAS USA 85:3014-3018; Armentano et al., (1990) PNAS USA 
87-6141-6145; Huberet al., (1991) PNAS USA 88:8039-8043; Ferry et al., (1991) PNAS 
USA 88:8377-8381; Chowdhury et al., (1991) Science 254:1802-1805; van Beusechem et 
al (1992) PNAS USA 89:7640-7644; Kay et al.. (1992) Human Gene Therapy 3:641-647; 
Dai etal (1992) PNAS USA 89:10892-10895; Hwu etal.. (1993) J. Immunol. 150:4104- 
4115- U S Patent No. 4,868.1 16; U.S. Patent No. 4.980.286; PCT Application WO 
89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345: and PCT 

Application WO 92/07573). 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retroviral-based vectors, by modifying the viral 
packaging proteins on the surface of the viral particle (see, for example PCT publ.cations 
W093/25234, WO94/06920, and W094/1 1524). For instance, strategies for the 
modification of the infection spectrum of retroviral vectors include: coupling antibod.es 
specific for cell surface antigens to the viral env protein (Roux et al.. (1989) PNAS USA 
86:9079-9083; Julan etal.. (1992) J. Gen Virol 73:3251-3255; and Goud et al.. (1983) 
Virology 1 63:251 -254); or coupling cell surface ligands to the viral env proteins (Neda et 
al., (1991) J. Biol. Chem. 266:14143-14146). Coupling can be in the form of the chem.cal 
cross-linking with a protein or other variety (e.g. lactose to convert the env protein to an 
asialoglycoprotein), as well as by generating fusion proteins (e.g. single-chain 
antibody/env fusion proteins). This technique, while useful to limit or otherwise d.rect the 
infection to certain tissue types, and can also be used to convert an ecotrop.c vector ,n to 
an amphotropic vector. 

Other Viral Systems 

Other viral vector systems that may have application in gene therapy have been 
derived from herpes virus, e.g.. Herpes Simplex Virus (U.S. Patent No. 5.631 ,236 by Woo 
et al issued May 20. 1 997). vaccinia virus (Ridgeway (1 988) Ridgeway. •Mammahan 
expression vectors." In: Rodriguez R L. Denhardt D T. ed. Vectors: A survey of mo^u.a, 
cloning vectors and their uses. Stoneham: Butterworth,; Baichwal and Sugden (1986) 
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-Vectors for gene transfer derived from animal DNA viruses: Transient and stable 
expression of transferred genes.' In: Kucheriapati R. ed. Gene transfer. New York: Plenum 
Press; Coupar et al. (1988) Gene, 68:1-10), and several RNA viruses. Preferred viruses 
include an alphavirus, a poxvirus, an arena virus, a vaccinia virus, a polio virus, and the 
like. In particular, herpes virus vectors may provide a unique strategy for persistence of 
the recombinant gene in cells of the central nervous system and ocular tissue (Pepose et 
al., (1994) Invest Ophthalmol Vis Sci 35:2662-2666). They offer several attractive 
features for various mammalian cells (Friedmann (1989) Science, 244:1275-1281 ; 
Ridgeway, 1988, supra\ Baichwal and Sugden, 1986, supra; Coupar et al.. 1988; Horwich 
et al.(1990) J.Virol., 64:642-650). 

With the recent recognition of defective hepatitis B viruses, new insight was gained 
into the structure-function relationship of different viral sequences. In vitro studies showed 
that the virus could retain the ability for helper-dependent packaging and reverse 
transcription despite the deletion of up to 80% of its genome (Horwich et al., 1990, supra). 
This suggested that large portions of the genome could be replaced with foreign genetic 
material. The hepatotropism and persistence (integration) were particularly attractive 
properties for liver-directed gene transfer. Chang et al. recently introduced the 
chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B virus genome in the 
place of the polymerase, surface, and pre-surface coding sequences. It was cotransfected 
with wild-type virus into an avian hepatoma ceil line. Culture media containing high titers of 
the recombinant virus were used to infect primary duckling hepatocytes. Stable CAT gene 
expression was detected for at least 24 days after transfection (Chang et al. (1991) 
Hepatology, 14:124A). 

Administration of Viral Vectors 

Generally the DNA or viral particles are transferred to a biologically compatible 
solution or pharmaceutical^ acceptable delivery vehicle, such as sterile saline, or other 
aqueous or non-aqueous isotonic sterile injection solutions or suspensions, numerous 
examples of which are well known in the art, including Ringer's, phosphate buffered saline, 
or other similar vehicles. Delivery of the transgene as naked DNA; as lipid-, liposome-, or 
otherwise formulated DNA; or as a recombinant viral vector is then preferably carried out 
via in vivo, lung-directed, gene therapy. This can be accomplished by various means, 
including nebuKzatiortf nhalation or by instillation via bronchoscopy- Recently, recombinant 
adenovirus encoding CFTR was administered via aerosol to human subjects in a phase I 
dinicai trial. Vector DNA and CFTR expression were clearly detected in the nose and 
airway of these patients with no acute toxic effects (Bellonet al., Human Gene Therapy, 
8(1 ):1 5-25, 1997). 

Preferably, the DNA or recombinant virus is administered insufficient amounts to 
transfect cells within the recipient's airways, including without limitation various airway 
epithelial cells, leukocytes residing within the airways and accessible airway smooth 
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muscle cells, and provide sufficient levels of transgene expression to provide lor 
observable ligand-responsive transcription of a target gene, preferably at a level providing 
therapeutic benefit without undue adverse effects. 

Optimal dosages of DNA or virus depends on a variety of factors, as discussed 
previously, and may thus vary somewhat from patient to patient. Again, therapeutically 
effective doses of viruses are considered to be in the range of about 20 to about 50 ml of 
saline solution containing concentrations of from about 1 X 1 0? to about 1 X 1 d 0 pfu of 
virus/ml, e.g. from 1 X 108 to 1 X 109 pfu of virus/ml. 

In a preferred embodiment, the ratio of viral particle containing a target gene versus 
viral particles containing nucleic acids encoding the fusion proteins of the invention is about 
1-1. However, other ratios can also be used. For example, in certain instances it may be 
desirable to administer twice as many particles having the target gene as those encoding 
the fusion proteins. Other ratios include 1:3. 1:4. 1:10.2:1.3:1.4:1.5:1. 10:1. The optimal 
ratio can be determined by performing in vitro assays using the different ratios of viral 
particles to determine which ratio results in highest expression and lowest background 
expression of the target gene. Similarly, in situations in which the fusion proteins are 
encoded by two different nucleic acids each encapsidated separately, one can vary the 
ratio between the three viral particles, according to the result desired. 

Methods of the invention 

The invention provides methods for engineering cells to render them responsive to 
ligand-mediated regulation of expression of a target gene. The cells may be engineered in 
vitro (ex vivo) or in vivo (i.e., in situ— within an organism). The target gene can be an 
endogenous gene or an exogenous gene (which may be of naturally occurring peptide 
sequence, or may contain non-naturally occurring peptide sequence). The method 
comprises introducing into the cell(s) of interest one or more genetic constructs or 
compositions of this invention. Examples of these methods include the genetic engineering 
of cells or animals (e.g.. mice. rats, etc.) as described herein for use, e.g., in the study of 
normal or pathologic biological processes (including various diseases), for the identification 
or characterization of genes or for the identification of new drugs or the evaluation of drug 
functioning, mechanism or efficacy. Other examples include the delivery of gene therapy to 
human subjects, whether in vivo or ex vivo. 

The invention also provides methods for using such engineered cells, or organisms 
containing them, to carry out the objectives mentioned above and elsewhere herein as well 
as in the cited references. These methods generally involve the application of ligand to the 
engineered cells or organism containing them in order to regulate the expression of a target 
gene. 
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Kits 

This invention further provides kits uselul for the various applications. One such 
kit contains one or more nucleic acids, each encoding a fusion protein of the invention. The 
kit may further comprise an additional nucleic acid comprising a target gene construct 
Alternatively, the additional nudeic acid may contain a cioning she for insertion of a des.red 
target gene by the practitioner. The kit may further contain a sample of a l.gand for 
regulating gene expression using these materials. 



Uses . . 

in one application, cells engineered in accordance with the invent.on are used to 
produce a target protein in vitro. In such applications, the cells are cultured or otherw.se 
maintained until production of the target protein is desired. At that time, the appropnate 
ligand is added to the culture medium, in an amount sufficient to cause the destred level of 
target protein production. The protein so produced may be recovered from the med.um or 
from the cells, and may be purified from other components of the cells or medium as 
desired. 

Proteins for commercial and investigational purposes are often produced using mammalian 
cell lines engineered to express the protein. The use of mammalian cells, rather than 
bacteria, insect or yeast cells, is indicated where the proper function of the prote.n requ.res 
post-translational modifications not generally performed by non-mammal.an cells. Examples 
of proteins produced commercially this way include, among others, erythropo.et.n. BMP-2. 
tissue plasminogen activator. Factor Vllf.c. Factor IX. and antibodies. The cost of 
producing proteins in this fashion is related to the level of expression achieved in the 
engineered cells. Thus, because the invention.described herein can achieve cons.derably 
higher expression levels than conventional expression systems, it may reduce the cost of 
protein production. Toxicity of target protein production can represent a second limitation, 
preventing cells from growing to high density and/or reducing production levels. Therefore, 
the ability to tightly control protein expression, as described herein, permits cells to be 
grown to high density in the absence of protein production. Expression of the target gene 
can be activated and the protein product subsequently harvested, only after an optimum 
cell density is reached, or when otherwise desired. 

In other applications, cells within an animal host or human subject are engineered in 
accordance with the invention, or cells so engineered are introduced into the animal or 
human subject, in either case, to prepare the recipient for ligand-mediated regulation of 
expression of a target gene. In the case of non-human animals, this can be done as part of 
veterinary treatment of the animal or to create an animal model for a vanety of research 
purposes. In the case of human subjects, this can be done as part of a therapeutic or 
prophylactic treatment program. 
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This invention is applicable to a variety of treatment approaches. For example, the 
target gene to be regulated can be an endogenous gene or a heterologous gene, and its 
expression may be activated or repressed by addition of ligand. 

In some cases the target gene is a factor necessary for the proliferation and/or 
differentiation of one or more cell types of interest. For example, it may be desirable to 
stimulate the expression of growth factors and lymphokines in a subject in which at least 
some of the blood cells have been destroyed, e.g.. by radiotherapy or chemotherapy. For 
example, expression of erythropoietin stimulates the production of red blood cells, 
expression of G-CSF stimulates the production of granulocytes, expressionof GM-CSF 
stimulates the prodution of various white blood cells, etc. Similarly in diseases or 
conditions in which one or more specific cell types are destroyed by the disease process, 
e.g.. in autoimmune diseases, the specific cells can be replenished by stimulating 
expression of one or more genes encoding factors stimulating proliferation of these cells. 
The method of the invention can also be used to increase the number of lymphocytes in a 
subject having AIDS, such as by stimulating expression of lymphokines, e.g.. IL-4. which 
stimulates proliferation of certain T helper (Th) cells. 

At least one advantage of increasing the production of an endogenous protein in a 
subject is the absence of an immune reaction against the protein, thus resulting in a more 
efficient treatment of the subject. In some cases of regulated expression of a heterologous 
protein it may be preferable to simultaneously administer to the subject an 
immunosuppressant drug. e.g.. rapamycin. cyclosporin A. FK506 or a mixture of any of the 
foregoing or other compound which represses immune reactions. 

Ceils which have been modified ex vivo with the DNA constructs of the invention 
may be grown in culture under selective conditions and cells which are selected as having 
the desired construct(s) may then be expanded and further analyzed, using, for example, 
the polymerase chain reaction for determining the presence of the construct in the host cells 
and/or assays for the production of the desired gene product(s). Once modified host cells 
have been identified, they may then be used as planned, e.g. grown in culture or 

introduced into a host organism. 

In cases in which the target gene is an endogenous gene of the cells to be 
engineered, the promoter and/or one or more other regions of the gene can be modified to 
include a target sequence that is specifically recognized by the DNA binding domain of a 
fusion protein of this invention so that the endogenous target gene is specifically 
recognized and regulated in a ligand-dependent manner. Such an embodiment can be 
useful in situations in which no DNA binding protein is known to specifically bind to a 
regulatory region of the target gene. Thus, in one embodiment, one or more cells are 
obtained from a subject or other source and geneticalfir engineered in vitro such that a 
desired control element is inserted, operatively linked to the target gene. The cell can then 
be introduced into the subject. Alternatively, prior to introduction of the cell to the subject, 
the cell is further modified to include a nucleic acid encoding a fusion protein compns.ng a 
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DNA binding domain which is capable of interacting specifically with the expression control 
element introduced into the target gene. In other examples of the invention, an 
endogenous gene is modified in vivo by. e.g.. homologous recombination, a technique well 
known in the art, and described, e.g.. in Thomas and Capecchi (1987) Cell 51:503; 
Mansour et al. (1988) Nature 336:348: and Joyner et al. (1989) Nature 338:153. 

A target gene may encode antisense RNA or a ribozyme or other RNA molecule 
which is not translated. For example, the method of the invention can be used to inhibit 
production of one or more specific proteins in a cell of a subject. The availability of potent 
transcriptional activators provided by the invention will ensure that high levels of RNA. 
e.g., antisense RNA, are produced in a cell. 

Other uses for this invention include biological research. The two-hybrid assay is 
a transcription based assay first described by Fields and Song. Nature, 340:245-247 
(1989). See also. Fields et al. US Patent No. 5.283.173 (1 Feb 1994). The two-hybrid 
assay is based on the observation that transcription factors contain separable functional 
modules that direct either DNA binding or transcription activation. A DNA binding domain 
expressed in cells will bind to DNA but not activate transcription as it lacks a transcription 
activation domain. Conversely, a transcription activation domain alone will not effect 
transcription in the absence of directed and/or intimate interaction with DNA such as would 
be provided by a DNA-binding domain. However, if the DNA binding domain and the 
transcription activation domains are each expressed as part of separate fusion proteins, 
and the fusion proteins are capable of associating, the "two-hybrid" complex so formed 
represents a reconstituted transcription factor (see FIG. 1 ). Such a reconstituted 
transcription factor is capable of initiating transcription of a reporter gene (e.g., a gene for a 
conveniently detectable marker such as beta-galactosidase or alkaline phosphatase 
(SEAP) or a protein important for cell viability) located downstream of DNA binding sites 
recognized by the DNA-binding domain. The amount of reporter gene expression, i.e., the 
amount of gene product produced, will reflect the extent to which the fusion proteins 
complex with one another. As described in Example 8. use of the bundling domains of this 
invention to recruit additional activation domains to the complex significantly increases the 
sensitivity of the assay, such that interactions which were previously undetected are now 
clearly visible. 

This dramatic improvement has important ramifications for a variety of applications 
of the 2-hybrid methodology, including those aimed at identifying genes of interest, at 
identifying peptide binding partners, and at identifying inhibitors of a protein-protein 
interaction of interest 

For instance, to identify genes of interest, e.g. cDNAs from a cDNA library, the genes 
are cloned into a construct designed to express the encoded polypeptides as fusion 
proteins linked to a bundling domain and to a transcription activation domain. As an 
example of the design of such constructs, one may start with a construct encoding a fusion 
protein such as an RLS fusion protein depicted in Fig 3. but replace the DNA sequence 
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encoding a ligand binding domain with a cloning site for th insertion of the cDNAs. The 
constructs (bearing the cDNA inserts) are introduced into host cells containing (or 
subsequently made to contain) (i) a nucleic acid encoding a fusion protein containing a 
DNA binding domain and a target domain of interest, and (ii) a reporter gene construct 
containing a recognition sequence for the DNA binding domain operably linked to a gene 
which encodes a detectable gene product or which is otherwise responsible for a 
detectable phenotype. Cells expressing a fusion protein containing a cDNA-encoded 
domain which binds to the target domain of interest express the reporter gene construct. 
The corresponding cDNA can thus be identified based on the fact that the protein it 
encodes binds to the target domain of interest. Potential advantages include the enhanced 
ability to detect and identify less abundant cDNAs, cDNAs which are expressed at lower 
levels relative to other cDNAs, cDNAs encoding gene products which bind to the target 

with relatively lower affinity, etc. 

In another 2-hybrid application, a collection of polypeptides may be expressed as 
fusion proteins using nucleic acid constructs encoding the desired collection of 
polypeptides in place of the cDNAs in the previous example. Peptide sequences which 
bind to a target protein or domain of interest may thus be identified. 

Another such application involves assays for identifying inhibitors of protein:protein 
interactions of interest In such assays a host cell is engineered to express two fusion 
proteins, the first containing a DNA binding domain and a first protein domain of interest, the 
second fusion protein containing a transcription activation domain, a bundling domain and a 
second protein domain of interest which binds to the first protein domain of interest. The 
cells also contain a reporter gene construct as described above. Because the two fusion 
proteins bind to one another, the reporter gene is normally expressed. Such cells may be 
used to identify compounds which inhibit the protein:protein interaction, for instance in a 
drug screening program. Thus, cells containing fusion proteins of this invention may be 
contacted with one or more compounds to be tested. The presence or amount of reporter 
gene product is then measured. A decrease in reporter expression in the presence of a 
substance, as compared to expression in the presence of less or none of the substance, 
indicates that the substance inhibited the protein:protein interaction. For additional details 
on the design and implementation of such assays which can be adapted to this invention, 
see e.g. WO 95/24419. Substances for testing may be obtained from a wide variety of 
sources, including without limitation, microbial broths, cellular extracts, conditioned media 
from cells, combinatorial libraries and other sources of naturally-occurring or synthetic 
compounds. 
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Pharmaceutical Compositions & Their Administration to Subjects Containing 
Engineered Cells 

Administration 

The ligand may be administered to a human or non-human subject using 
pharmaceutical^ acceptable materials and methods of administration. Various 
formulations, routes of administration, dose and dosing schedule may be used for the 
administration of ligand. depending upon factors such as the binding affinity of the ligand tor 
the ligand binding domain, the choice of transcription regulatory domains, the condition and 
cirmcumstances of the recipient, the response desired, the biological half-life and 
bioavailability of the ligand, the biological half-life and specific activity of the target gene 
product, the number and location of engineered cells present, etc. The drug may be 
administered parenterally. or more preferably orally. Dosage and frequency of 
administration will depend upon factors such as described above. The drug may be taken 
orally as a pill, powder, or dispersion; bucally; sublingually; injected intravasculariy. 
intraperitoneally. subcutaneously; or the like. The drug (and antagonists, as discussed 
below) may be formulated using conventional methods and materials well known in the art 
for the various routes of administration. The precise dose and particular method of 
administration will depend upon the above factors and be determined by the attending 
physician or healthcare provider. However, we show here that in the presence of bundled 
activation domains, the amount of drug needed to oligomerize the fusion proteins of this 
system is greatly reduced, by an order of magnitude or more. 

The particular dosage of the drug for any application may be determined in 
accordance with conventional approaches and.procedures for therapeutic dosage 
monitoring. A dose of the drug within a predetermined range is given and the patient's 
response is monitored so that the level of therapeutic response and the relationship of 
target gene expression level over time may be determined. Depending on the expression 
levels observed during the time period and the therapeutic response, one may adjust the 
level of subsequent dosing to alter the resultant expression level over time or to otherwise 
improve the therapeutic response. This process may be iteratively repeated until the 
dosage is optimized for therapeutic response. Where the drug is to be administered 
chronically, once a maintenance dosage of the drug has been determined, one may conduct 
periodic follow-up monitoring to assure that the overall therapeutic response continues to 
be achieved. 

In the event that the activation by the drug is to be reversed, administration of drug 
may be suspended so that cells return to a basal rate of proliferation. To effect a more 
active reversal of therapy, an antagonist of the drug may be administered. An antagonist is 
a compound which binds to the drug or drug-binding domain to inhibit interaction of the drug 
with the fusion protein(s) and thus inhibit the downstream biological event. Antagonists 
include drug analogs, homologs or components which are monovalent with respect to the 
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fusion proteins. Such compounds bind to the fusion proteins but do not support clustering 
of the fusion proteins as is required for activation of signaling. Thus, in the case of an 
adverse reaction or the desire to terminate the therapeutic effect, an antagonist can be 
administered in any convenient way. particularty intravascular^ or by 
inhalation/nebulization, if a rapid reversal is desired. 

Compositions 

Drugs (i e the ligands) for use in this invention can exist in free form or. where 
appropriate, in salt form. The preparation of a wide variety of pharmaceutical^ acceptable 
salts is well-known to those of skill in the art. Pharmaceutical* acceptable salts of venous 
compounds include the conventional non-toxic salts or the quaternary ammonium salts of 
such compounds which are formed, for example, from inorganic or organic acids of bases. 

The drugs may form hydrates or solvates. It is known to those of skill in the art that 
charged compounds form hydrated species when lyophilized with water, or form solvated 
species when concentrated in a solution with an appropriate organic solvent. 

The drugs can also be administered as pharmaceutical compositions compnsing a 
therapeutically (or prophylactically) effective amount of the drug, and a pharmaceutical* 
acceptable carrier or excipient Carriers include e.g. saline, buffered saline, dextrose, water, 
glycerol ethanol. and combinations thereof, and are discussed in greater detail below. The 
composition, if desired, can also contain minor amounts of wetting or emulsifying agents, or 
pH buffering agents. The composition can be a liquid solution, suspension. emuls.on. 
tablet pill, capsule, sustained release formulation, or powder. The composition can be 
formulated as a suppository, with traditional binders and carriers such as triglycendes. Oral 
formulation can include standard carriers such as pharmaceutical grades of mann.tol. 
lactose, starch, magnesium stearate. sodium saccharine, cellulose, magnesium carbonate, 
etc. Formulation may involve mixing, granulating and compressing or d.sso!v.ng the 
ingredients as appropriate to the desired preparation. 
" The pharmaceutical carrier employed may be. for example, either a solid or liquid. 
Illustrative solid carriers include lactose, terra alba, sucrose, talc, gelatin, agar, pectin, 
acacia, magnesium stearate. stearic acid and the like. A solid carrier can include one or more 
substances which may also act as flavoring agents, lubricants, solubilizers. suspending 
agents, fillers, glidants, compression aids, binders or tablet-disintegrating agents: it can 
also be an encapsulating material. In powders, the carrier is a finely divided solid which is 
in admixture with the finely divided active ingredient In tablets, the active ingredient is 
mixed with a carrier having the necessary compression properties in suitable proportions 
and compacted in the shape and size desired. The powders and tablets preferably contain 
up to 99% of the active ingredient. Suitable solid carriers include, for example, calcium 
phosphate, magnesium stearate, talc, sugars, lactose, dextrin, starch, gelatin, cellulose, 
methyl cellulose, sodium carboxymethyl cellulose, polyvinylpyrrolidine. low melting waxes 
and ion exchange resins. 
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Illustrative liquid carriers include syrup, peanut oil, olive oil, water, etc. Liquid 
carriers are used in preparing solutions, suspensions, emulsions, syrups, elixirs and 
pressurized compositions. The active ingredient can be dissolved or suspended in a 
pharmaceutically acceptable liquid carrier such as water, an organic solvent, a mixture of 
both or pharmaceutically acceptable oils or fats. The liquid carrier can contain other suitable 
pharmaceutical additives such as solubilizers, emulsifiers, buffers, preservatives, 
sweeteners, flavoring agents, suspending agents, thickening agents, colors, viscosity 
regulators, stabilizers or osmo-regulators. Suitable examples of liquid carriers for oral and 
parenteral administration include water (partially containing additives as above, e.g. 
cellulose derivatives, prelerably sodium carboxymethyl cellulose solution), alcohols 
(including monohydric alcohols and polyhydric alcohols, e.g. glycols) and their derivatives, 
and oils (e.g. fractionated coconut oil and arachis oil). For parenteral administration, the 
carrier can also be an oily ester such as ethyl oleate and isopropyl myristate. Sterile liquid 
carders are useful in sterile liquid form compositions for parenteral administration. The liquid 
carrier for pressurized compositions can be halogenated hydrocarbon or other 
pharmaceutically acceptable propellant. Liquid pharmaceutical compositions which are 
sterile solutions or suspensions can be utilized by, for example, intramuscular, 
intraperitoneal or subcutaneous injection. Sterile solutions can also be administered 
intravenously. The drugs can also be administered orally either in liquid or solid composition 
form. 

The carrier or excipient may include time delay material well known to the art. such 
as glyceryl monostearate or glyceryl distearate along or with a wax, ethylcellulose, 
hydroxypropylmethyicellulose, methylmethacrylate and the like. When formulated for oral 
administration, 0.01% Tween 80 in PHOSAL PG-50 (phospholipid concentrate with 
1 ,2-propylene glycol. A. Nattermann & Cie. GmbH) may be used as an oral formulation for 
a variety of drugs for use in the practice of this invention. 

A wide variety of pharmaceutical forms can be employed. If a solid carrier is used, 
the preparation can be tableted. placed in a hard gelatin capsule in powder or pellet form or 
in the form of a troche or lozenge. The amount of solid carrier will vary widely but 
preferably will be from about 25 mg to about 1 g. If a liquid carrier is used, the preparation 
will be in the form of a syrup, emulsion, soft gelatin capsule, sterile injectable solution or 
suspension in an ampule or vial or nonaqueous liquid suspension. 

To obtain a stable water soluble dosage form, a pharmaceutically acceptable salt of 
the drug may be dissolved in an aqueous solution of an organic or inorganic acid, such as a 
i 0.3M solution of succinic acid or citric acid. Alternatively, acidic derivatives can be dissolved 
in suitable basic solutions. If a soluble salt form is not available, the compound is 
dissolved in a suitable cosoivent or combinations thereof. Examples of such suitable 
dissolved in a suitable cosoivent or combinations thereof. Examples of such suitable 
cosolvents include, but are not limited to. alcohol, propylene glycol, polyethylene glycol 
o 300. polysorbate 80. glycerin, polyoxyethylated fatty acids, fatty alcohols or glycenn 
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hydroxy fatty acids esters and the like in concentrations ranging from 0-60% of the total 
volume. 

Various delivery systems are known and can be used to administer the drugs, or 
the various formulations thereof, including tablets, capsules, injectable solutions, 
encapsulation in liposomes, micropartides. microcapsules, etc. Preferred routes of 
administration to a patient are oral, sublingual and bucal. Methods of introduction also could 
include but are not limited to dermal, intradermal, intramuscular, intraperitoneal, intravenous, 
subcutaneous, intranasal, pulmonary, epidural, ocular and (as is usually preferred) oral 
routes. The drug may be administered by any convenient or otherwise appropriate route, 
for example by infusion or bolus injection, by absorption through epithelial or 
mucocutaneous linings (.e.g., oral mucosa, rectal and intestinal mucosa, etc.) and may be 
administered together with other biologically active agents. Administration can be systemic 
or local. For ex vivo applications, the drug will be delivered as a liquid solution to the 
cellular composition. 

In a specific embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for intravenous administration to 
human beings. Typically, compositions for intravenous administration are solutions in 
sterile isotonic aqueous buffer. Where necessary, the composition may also include a 
solubiiizing agent and a local anesthetic to ease pain at the side of the injection. Generally, 
the ingredients are supplied either separately or mixed together in unit dosage form, for 
example, as a lyophilized powder or water free concentrate in a hermetically sealed 
container such as an ampoule or sachette indicating the quantity of active agent. Where the 
composition is to be administered by infusion, it can be dispensed with an infusion bottle 
containing sterile pharmaceutical grade water or saline. Where the composition is 
administered by injection, an ampoule of sterile water for injection or saline can be provided 
so that the ingredients may be mixed prior to administration. 

In addition, in certain instances, it is expected that the compound may be disposed 
within devices placed upon, in, or under the skin. Such devices include patches, implants, 
and injections which release the compound into the skin, by either passive or active 
release mechanisms. 

Materials and methods for producing the various formulations are well known in the 
art and may be adapted for practicing the subject invention. See e.g. US Patent Nos. 
5,182,293 and 4,837,31 1 (tablets, capsules and other oral formulations as well as 
intravenous formulations) and European Patent Application Publication Nos. 0 649 659 
(published April 26. 1995; rapamycin formulation for IV administration) and 0 648 494 
(published April 19, 1995; rapamycin formulation for oral administration). 

The effective dose of the drug will typically be in the range of about 0.01 to about 
50 mg/kgs, preferably about 0.1 to about 10 mg/kg of mammalian body weight, 
administered in single or multiple doses. Generally, the compound may be administered to 
patients in need of such treatment in a daily dose range of about 1 to about 2000 mg per 
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patient In mbodiments in which the compound is rapamycin or an analog thereof with 
some residual immunosuppressive effects, it is preferred that the dose administered be 
below that associated with undue immunosuppressive eflects. 

The amount of a given drug which will be effective in the treatment or prevention of 
a particular disorder or condition will depend in part on the severity of the disorder or 
condition, and can be determined by standard clinical techniques. In addition, in vitro or in 
vivo assays may optionally be employed to help identify optimal dosage ranges. 
Effective doses may be extrapolated from dose-response curves derived from in vitro or 
animal model test systems. The precise dosage level should be determined by the 
attending physician or other health care provider and will depend upon well known factors, 
including route of administration, and the age, body weight, sex and general health of the 
individual; the nature, severity and clinical stage of the disease; the use (or not) of 
concomitant therapies; and the nature and extent of genetic engineering of cells in the 
patient 

The drugs can also be provided in a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions. Optionally associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceutical or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 



The full contents of all references cited in this document, including references from 
the scientific literature, issued patents and published patent applications, are hereby 
expressly incorporated by reference. 

The following examples contain important additional information, exemplification and 
guidance which can be adapted to the practice of this invention in its various embodiments 
and the equivalents thereof. The examples are offered by way illustration should not be 
construed as limiting in any way. As noted throughout this document, the invention is 
broadly applicable and permits a wide range of design choices by the practitioner. 

The practice of this invention will employ, unless otherwise indicated, conventional 
techniques of ceil biology, cell culture, molecular biology, transgenic biology, microbiology, 
recombinant DNA, immunology, virology, pharmacology, chemistry, and pharmaceutical 
formulation and administration which are within the skill of the art Such techniques are 
explained fully in the literature. See, for example. Molecular Cloning A Laboratory Manual. 
2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 
1989); DNA Cloning. Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis 
(M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683.195; Nucleic Acid Hybridization 
(B. D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & 



62 



WO 99/10510 



PCT/US98/17723 



S J Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss. Inc.. 1987); 
Immobilized Cells And Enzymes (IRL Press, 1986); B. Pertoal. A Practical Guide To 
Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., 
N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds.. 
1987! Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu 
et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, 
eds.. Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes l- 
IV (D. M. Weir and C. C. Blackwell, eds.. 1986); Manipulating the Mouse Embryo, (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 
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Examples 

Example 1: Construction of plasmids encoding bundled activation domains: 
Transcription factor fusion proteins were expressed from pCGNN (Attar, R.M. & Oilman. 
M Z (1992) Expression cloning of a novel zinc-finger protein that binds to the c-fos serum 
response element. Mol. Cell. Biol. 12, 2432-2443). Inserts cloned into pCGNN as Xbat- 
BamHI fragments are transcribed under control of the human CMV enhancer and promoter 
and are expressed with an amino-terminal epitope tag (a 16-amino acid port.on of the 
Haemophilus influenzae hemagglutinin gene) and nuclear localization sequence from the 
SV40 large T antigen. Individual components of the transcription factors were synthes.zed 
by polymerase chain reaction as fragments containing an Xbal site immediately upstream 
of the first codon and a Spel site, an in-frame stop codon. and a BamHl site immediately 
downstream of the last codon. Fusion proteins comprising multiple component were 
assembled by stepwise insertion of Xbal-BamHI fragments into Spel/BamHI-opened 
vectors. The individual components used and their abbreviations are as follows: 

G = yeast Gal4 DNA binding domain, amino acids 1-94 
F = human FKBP12, amino acids 1-107 

r = FRB domain of human FRAP, amino acids 2025-21 13 

S = activation domain from the P 65 subunrt of human NF-kB amino acids 361-550 
V = activation domain from Herpesvirus VP 1 6 .amino aads 410-494 

L - E coii lactose repressor, amino acids 46-360 . _^ . 

MT = MiSraT Tetramerization domain of E. coii lactose repressor, ammo acds 324-360 

For example pCGNN-GF2 was made by insertion of the Gal4 DNA binding domain into 
pCGNN to generate pCGNN-G, followed by the sequential insertion of 2 FKBP domains. 
PCGNN-L was made inserting the Xba1/BamH1 digested PCR fragments of lactose 
repressor coding sequences (amino acids 46-360) into PCGNN vector. PCGNN- LS was 
made by inserting P 65 activation domain (amino acids 361-550) into Spe1 and BamHl 
digested PCGNN-l_expression plasmid. PCGNN-G AL4 CB was made by insert.ng Xba1 
and BamHl digested fragments of c-CBL sequences into Spe1 and BamHl digested 
PCGNN-GAL4 expression plasmid. PCGNN-MA was made by inserting Xba1 and 
BamHl digested DNA fragments containing SH3 domain coding sequences into 
Xba1/BamH1 digested PCGNN. PCGNN-MAS and PCGNN-MAMTS were made by 
inserting the S ( P 65 activation domain) and MTS (minimal tetramerization domain fused to 
P 65 activation domain) respectively into Spe1/BamH1 digested PCGNN-MA vector. 
5xGAL4-IL2-SEAP contains 5 GAL4 sites upstream of a minimal IL2 promoter dnving 
expression of the SEAP gene (a gift of J. Morgenstem and S. Ho). The retroviral vector 
P LH-5xGal4-IL2-SEAP was constructed by cloning the 5xGAL4-ll_2-SEAP fragment 
described above into the vector pLH (Rivera et al. 1996. Nature Medicine 2:1028-1032; 
Natesan et al. Nature 1997 Nov 27 390:6658 349-50). which also contains the hygromycin 
B resistance gene driven by the Moloney murine leukemia virus long terminal repeat 
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Example 2: Generation of stable cell lines: 

To generate cells containing the P LH-5xGAL4-lL2-SEAP reporter stab.y ^ffv 
helper-free retrovirus, generated as described (Rivera et al. 1996; Natesan et al, 1997). 
was used to infect HT1080 cells. Hundreds of hygromycin B (300 mg/ml) res.stant clones 
were pooled (HT1080 B pool) and individual clones screened by transient transfection 
with pCG-GS. The most responsive clone. HT1080B. was selected for further analysis. 

Example 3: Transient Transfections 

HT1080 cells were grown at 3?o C in MEM medium containing 10% fetal calf serum, non- 
essential amino acids and penicillin-streptomycin. Twenty-four hours before transfection. 
approximately 2 x 105 cells were seeded in each well in a 12-well plate. Cells were 
transfected using Lipofectamine as recommended (Gibco BRL). Cells in each well 
received the amounts plasmids indicated in the figure, with or without 400 ng of reporter 
plasmid with the total amount of DNA being adjusted to 1 .25 ug with pUC19. For 
experiments shown in Fig. 5. 10 ng of plasmid expressing DNA binding domain fusions 
and increasing amounts of plasmid expressing P 65 activation domain fusions were 
included. After transfection for five hrs. the medium was removed and 1 ml of fresh medium 
added 18-24 hrs later. 100 ul medium was removed and assayed tor SEAP activity us.ng 
a Luminescence Spectrometer (Perkin Elmer) at 350 nm excitation and 450 nm emission. 
Where indicated. 2-5 ul of medium was also assayed for hGH protein as recommended 
(Nichols Diagnostic). 

Example 4: Delivery of bundled activation domains to the GAL4 DNA binding 

domain . . - 

The basic system used for regulated gene expression (Fig. 1 A)involves two fusion 
prcteins. one containing a DNA-binding domain (such as GAL4) fused to a single copy of 
FKBP12 and the other containing a transcription activation domain (such as from the pob 
subunit of NF-kB) fused to the FRB domain of FRAP (see e.g.. Rivera et al). In the 
presence of the natural-product rapamycin. which forms a high affinity complex with FKBP 
and FRB domains, the FRB-p65 fusion protein is efficiently recruited to the GAL4-FKBP 
fusion protein. This basic system results in the delivery of a maximum of one p65 
activation domain per DNA binding domain monomer (Fig. 1 A). In this system the number of 
activation domains delivered to the promoter can be increased by fusing multiple FKBP 
moieties to GAL4. allowing each DNA binding domain to recruit multiple FRB-p65 activat.on 
domain fusions (Fig. 1 B). Because the fusion protein containing the activation domain is 
expressed separately in this system, it is possible to bundle activation domain fusion 
proteins and deliver them to FKBP moieties linked to the GAL4 DNA binding doma,n. For 
example, the addition of a tetramerization domain present in the E. coli lactose repressor 
) between the FRB and activation domains should generate a fusion protein "bundle 
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comprising of four activation domains and FRB domains, which in the presence of 
"dimerizer" can be delivered to ach FKBP moiety (Fig. 1 C). In the configuration dep.cted .n 
Fig 1D rapamycin mediates the recruitment of a tetrameric complex of bundled acWation 
domain fusion proteins to each FKBP of a Gal4-4xFKBP fusion protein, permitting 
recruitment of up to sixteen P 65 activation domains to a single GAL4 monomer. Analogous 
improvements on allostery-based systems, also based on bundling, are shown .n F.gs 1E 
- 1H. 

Example 5: Transcriptional activation is proportional to the number of activation 
domains bound to the promoter. 

To test how bundled activation domain fusion proteins function in this system, we 
transfected HT1080 B cells with plasmids expressing various transcription factor fusion 
proteins and treated the cells with 10 nM rapamycin to deliver the activation domains to the 
promoter. We observed that when only one RS or RLS fusion protein is delivered to each 
GAL4 monomer (GF1 + RS and GF1+RLS), bundled activation domain fusion proteins 
induced the reporter gene strongly as compared to the unbundled activation domain njs.on 
proteins. This finding suggests that bundled activation domain fusion proteins, because of 
their ability to deliver more activation domains to the promoter, function as highly potent 
inducers of transcription. Furthermore, our studies using various combinations of DMA 
binding fusion proteins and activation domain fusion proteins revealed that the level of 
reporter gene expression is roughly linear with the number of activation domains that can 
be delivered to a single GAL4 monomer bound to its promoter (Fig. 2A). 

The RLS fusion protein is capable of delivering four times more p65 activation 
domain to the promoter than its unbundled counterpart. RS. In theory. FRB fusion protein 
containing four tandemfy reiterated p65 activation domain (RS4) should deliver same 
number of activation domains to the promoter as RLS and therefore should have s,milar 
transactivation capacity. To examine whether RS4 can function in a manner similar to RLS 
in the rapamycin regulated gene expression system , we transfected expression plasm.ds 
encoding the DNA binding receptor. GF1 , together with RS4 or RLS fusion proteins .nto 
HT1080 B cells and analyzed the expression of the integrated reporter gene by adding 10 
nM rapamycin to the medium. We found that rapamycin induced the reporter gene strongly 
in cells expressing the GFIand RLS but not the GFIand RS4 combination of fusion 
proteins, indicating that the reiterated P 65 activation domains are weak inducers of 
transcription in the d.menzer system (Fig^B). In contrast, rapamycin was able to mduce 
reporter gene expression in the presence of the GF3 and RS4 combination of fusion 
proteins, albiet at much lower levels than the GF1/RLS combination of proteins. W,thout 
being limited to a particular theory. GF3 fusion proteins should recruit three times more 
activation domains to the promoter than GF1 . The finding that RS4 fusion protein can 
induce transcriptional activation much more strongly when tethered to GF3 as compared to 
GF1 suggests that when the concentration of activation domain fusion protein ,s ve-y low. 
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more activation domains can be recruited to the promoter by increasing the number of 
FKBP moieties tused to the GAL4 DNA binding domain. A western blot analysis of the 
intracellular levels of the transfected proteins revealed that the amount of RS4 in the cell ,s 
below the level of detection, which may explain why it acts as a poor .nducer of 
transcription. These observations strongly suggest that the bundling strategy, unlike 
reiteration, generates highly potent activation domains that are less toxic to cells. 

One possible explanation for pan or all of the robust induction of gene expression 
by RLS fusion proteins is that the close proximity of four FRB moities in the RLS bundle 
produces an avidity effect. To test this, we devised a strategy as illustrated in Rg. 3* In 
theory, co-expressing a limited amount of RLS in the presence of a large excess of LS 
fusion protein should promote the formation of RLS bundles containing, at most, a single 
FRB domatt: To examine the consequences of reducing the number of FRB domams ,n the 
RLS bundle on reporter gene expression, we co-transfected HT1080 B cells with relevant 
expression plasmids and analyzed the expression of the GAL4 responsive gene in the 
presence of 10 nM rapamycin in the medium. As previously observed (see F.g. 2A). 
rapamycin induced only low levels of reporter gene expression in cells expressing GF1 
and RS fusion proteins. However, reporter gene expression was very robust ,n cells 
expressing GF1 and RLS fusion proteins (Fig. 3B). To our surprise, in cells expressing 
GF1, a limited amount of RLS and a large excess of LS fusion protein, rapamycin induced 
reporter gene expression to even higher levels than those achieved by GF1 and RLS 
fusion proteins alone (F.g. 3B). This suggests that the strong stimulation of gene 
expression by RLS fusion proteins is not dependent on the presence of multiple FRB 
domains in the bund.e. Indeed, the data shown here indicates that the presence of multip.e 
FRB domains in RLS fusion protein actually diminishes its capacity to activate gene 
expression to the maximum possible level. It is likely that rapamycin allows multiple FRB 
domains in the RLS to make contact with more than one GAL4-FKBP monomer bound to 
the promoter, effectively reducing the number of activation domains delivered. However . 
RLS bundles with a single FRB domain can make contact with only a single GAL4-FKBP 
monomer and therefore can recmit greater number of activation domains to the promoter, 
leading to a slight increase in the target gene expression. 

To assess the consequences of reducing the number of activation domains in the 
RLS fusion protein, we expressed excess amounts of lactose repressor reg.on (L. ammo 
acids 46-340) relative to RLS. together with the DNA binding protein GF1 and induced 
reporter gene expression by adding 10 nM rapamycin to the medium. In this situation, the 
tetrameric bundles formed should contain a maximum of one activation doma.n and one 
FRB domain. Because reducing the number of FRB domains in the RLS bundle mcreased 
reporter gene expression, any inhibition of reporter gene expression in the presence of 
excess L region relative to RLS can be attributed to a decline in the number of activation 
domains recruited to the promoter. The data in F,g. 3B show that an excess of a portion of 
> the lactose repressor inhibits rapamycin-induced reporter gene expression in cells 



69 



WO 99/10510 



PCI7US98/17723 



expressing GF1 and RLS fusion proteins. A western blot analysis of the recombinant 
proteins in the transfected cells shows a good correlation between the amount of plasm.d 
used in the transtection and the corresponding expression level of protein. Taken together, 
these observations strongly suggest that the RLS fusion proteins function as potent 
inducers of transcription primarily because of their ability to deliver significantly more 
activation domains to the promoter. 

Example 6: Activation of transcription using a minimal tetramerization domain 

and synergizing activation domains 

The experiments described used the lactose repressor (minus its DNA b.nd.ng 
domain) as the bundling domain in fusino proteins also containing the FRB and act.vat.on 
domains. In addition to the tetramerization domain, this portion of lactose repressor contains 
the lactose binding domain and the flanking linker regions. To determine whether the 
tetramerization domain of lactose repressor alone is sufficient for bundling fusion proteins, 
we made an expression plasmid. RMTS. in which the lactose repressor coding sequences 
(amino acids 46-360) in the RLS fusion protein was replaced with a thirty-six ammo aad 
region between amino acids 324 and 360 containing the tetramerization domain and a 
portion of upstream linker region (MT). We have found that combination of P 65 and VP16 
activation domains when fused to GAL4 DNA binding domain synergistic^ induced GAL 
responsive genes. To examine whether they behave similarly when bundled together 
using the minimal lactose repressor minimal tetramerization domain, we generated two 
additional plasmids. RMTSV and RMTV in which the VP1 6 activation domain (ammo acids 
419-490) was fused to RMTS or RMT respectively. We then co-transfected plasm.ds 
expressing appropriate combinations of fusion proteins (Fig. 4) into HT1080 B cells 
carrying a stably integrated GAL4 responsive reporter gene and treated the cells with 
rapamycin to stimulate target gene expression. We observed that in cells expressing 
GF4/RMTSV and GF4/RMTS combination of fusion proteins, rapamycin induced the 
reporter gene expression to roughly six and three fold higher than GF4/RS comb.nat.on of 
fusion proteins. In cells expressing GF4/RMTV or GF4/RSV combinations of fus,on 
proteins, rapamycin induced the reporter gene only marginally higher than the levels 
induced by GF4/RS fusion proteins (Fig. 4). Although the fold induction of reporter gene 
expression by GF4/RMTS and GF4/RMTSV is slightly lower than GF4/RLS and 
GF4/RLSV three and six fold compared to four and eight fold respectively (see figure 2A). 
strong stimulation of gene expression by the activation domain fusion proteins containing 
the lactose repressor minimal tetramerization domain suggest that the minimal 
tetramerization domain is sufficient to bundle fusion proteins. 
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Example 7: Bundling reduces the threshold number of activators required to 
induce peak levels of gen expression: 

If the strong stimulation of gene expression induced by the bundled fus.on prote.ns 
containing p65 activation domains is simply due to their ability to deliver more action 
domains to the promoter, a lower .eve. of fusion protein containing ^ -c^n dor^n 
should be sufficient in the case of bundling, as compared to unbundled act.vat.on doma,ns. 
to strongly stimulate reporter gene expression. In the dimerizer system, the number of 
reconstituted activators formed can be controlled either by adjusting the amount of 
activation domain fusion proteins or by varying the amount of rapamycin added to^ the 
medium. We have employed both of these complementary approaches to address the 
question of whether bundling of activation domains reduces the threshold amount of 
abators required for robust expression of the reporter gene. In the f.rst approach. vary.ng 
amounts of bundled activation domains. RMTS and RMTSV, or their unbundled 
counterpart. RS. were expressed in HT1080 B cells together with a fixed amount of GF4 
the DNA binding receptor (Fig. 5A V The activators were reconstituted by the add-on of 10 
nM rapamycin to the medium. The level of recombinant proteins expressed .n the 
transfeoted cells was determined by western blot analysis (Fig. 5B). At the lo^sUeve of 
activation domains expressed, rapamycin failed to induce transition of the reporter gene 
in cells expressing the GF4 + RS combination of fusion ^^^'^f^ 
robust activation of reporter gene expression in cells conta,n.ng the GF4+RMTS or 
RMTSV combination of fusion proteins. When the activation domain fusion protein* > were 
present at high levels, rapamycin induced reporter gene expression to approximately four- 
and two-fold higher levels in cells containing the GF4+RMTSV and GF4-RMTS 
comoTation of fusion proteins, respectively, as compared to GF4 + RS fus.on proteins, 
indeed, the ieve. of reporter gene expression induced by the lowest amounts of RMTSV 
exceeded the level stimulated by the highest amount of RS fusion prote.ns .n the cell (F,g. 
5A) These observations suggest that peak levels of reporter gene expression can be 
achieved with fewer reconstituted activators containing bundled activation domains than 

with their unbundled counterparts. 

m the second complementary approach, we transfected HT1080 B ceils with a fixed 
amount of the expression p.asmids used in figure SB and induced the reconstitute , o the 
activators by adding varying amounts of rapamycin to the medium. In the presence of the 
GF4 DNA binding receptor, both RMTSV and RMTS fusion proteins induced the reporter 
gene expression robustly at 1 nM rapamycin in the medium. At this concentration of 
, rapamycin in the medium, the GF4 + RS combination of fusion proteins failed to induce , the 
reporter gene significantly above background levels. In all cases, we observed peak 
levels of reporter gene expression in the presence of 10 nM rapamycin in the medium £g. 
5B) Collectively the finding that relatively low numbers of activators containing mutople 
bundled activation domains are sufficient to strongly induce gene expression suggests that 
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the threshold amount of activators required for peak levels of gene expression can be 
significantly lowered by incr asing the potency of activators. 

Example 8: Bundling activation domain fusion proteins in the two-hybrid system 

"a' f!nd*g t~t expression of target genes can be achieved in the presence 
of relatively few reconstituted activators containing bundled, but not unbundled, activation 
Proteins has important implications in two-hybrid assays. Although the twc- 
hybrid system is a highly sensitive assay to detect protein-protein interactions ,n vwo a 
number of factors may curtail the interaction between two hybrid prote,ns expressed n the 
cel. One frequently faced problem with the two hybrid system is that eukaryotic cells, 
because of their highly conserved biochemica. regulatory pathways, often exh,brt .poor 
tolerance to high levels of the hybrid proteins, particularly those containing «"^*™ 6 
activation domain, resulting in the very poor expression of fus,on prote.ns ,n these ceUs. or 
in some cases, cell death. Because the success of this assay is dependent on the two 
hybrid proteins finding each other, it is essential that one or both of the hvbnd prote-ns 
preferably.the fusion protein containing the activation domain, is present at relatively h.gh 
amounts to promote the interaction between the two hybnd prote.ns. 

To examine whether the use of bundled activation domain fusion proteins would 
allow detection of protein-protein interactions that were previously undetectable m 
mammalian two-hybrid assays, we chose to study the interaction between two proteins. 
Tamely, the pro toincogene C-Cb, and the C-Src SH3. The pro.ine-nch dorna.ns of *e 
C-Cb. proto-oncogene have been shown to bind to the SH3 domains of a numb* o< 
signaling proteins both in in vtfro and in yeast two-hybrid assays. However, n mammal.an 
two-hybrid experiments, the GAL4-CBL and Src SH3-VP1 6 hybrid proteins farted to 
induce the expression of a stably integrated reporter gene. To examme whether 
expressing "bundled" Src SH3-activation domain fusion protein together w,th GAL4-CBL 
SZto the GAL4 responsive gene, we made appropriate plasmids for express.ng 
the fusion proteins shown schematically in F,g. 6A and B, and introduced relevant 
combinations of expression plasmids into HT1080 B cells by transient Section .We 
observed that neither GCBL alone, nor GCBL in the presence SH3-VP16 or SH3-P65 
induced the reporter gene expression to detectable levels. However. ,n the presence of 
the bundled fusion proteins . SH3-LVP16 or SH3-Lp65. GCBL induced the reporter gene 
very strongly. These results show that the use of bundled activation domam fus,on prolem 
can significantly improve the sensitivity of the two-hybrid assay (F ig. 6^To assess 
whether the unbundled activation domain fusion proteins fail to .nduce the reporter gene 
expression due to their low intracellular levels, we carried out western blot analysis of 
lysates from the transfected cells. A representative western blot shown .n F,g. 6C 
Ltrates that the unbundled fusion proteins. SH3-VP16 
5 present at nigner amounts than their bundled counterparts. SH3-LVP16 and SH3-Lp65 
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(Fia 6C) suggesting that the lack of reporter gene activation is not linked to the overall 
intracellular 1 vels of the activation domain fusion proteins. However, in a separate western 
Hot probed with GAL4 antibody, we were unable to detect the ^^^^ 
suggesting that this fusion protein is toxic to cells. Thus, we conclude that when the DMA 
binding component (GCBL) is present in very low amounts in the cells, only the bundled 
activation domain fusion proteins are capable of delivering a sufficient number of act-vafon 
domains to the promoter for transcriptional activation of the reporter gene to occur. Taken 
together, these data strongly suggest that bundling activation doma.n f us,on prote,ns, ,n 
mammalian two-hybrid assays, may greatly enhance the detection of interactions between 
two proteins when one or both of them is present at ve,y low levels in the cell. 



Equivalents ^ 

Those skilled in the art will recognize, or be able to ascertatn us.ng no more than 
routine experimentation, numerous equivalents to the specific materials and methods 
described herein. Such equivalents are considered to be within the scope of the 
invention.. 
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Claims: 

1 . A recombinant nucleic acid encoding a fusion protein containing a bundling domain and 
at least one additional domain that is heterologous thereto. 

2. The recombinant nucleic acid of daim 1 wherein the bundling domain is a dimerization 
domain, trimerization domain or tetramerization domain. 

3 The recombinant nucleic acid of daim 2 wherein the bundling domain is or is derived 
from a lac repressor tetramerization domain, a p53 tetramerization domain or a leucne 
zipper domain. 

A. The recombinant nucleic add of any of claims 1-3 wherein the heterologous domain is 
a transcription activation domain. 

5. The recombinant nucie.c acid of any of daims 1 -3 wherein the heterologous domain is 
a transcription repression domain. 

6. The recombinant nudeic add of any of daims 1-3 wherein the heterologous domain is 
a DNA binding domain. 

7. The recombinant nucleic add of any of claims 1 -3 wherein the heterologous domain is 
a ligand binding domain. 

8. The recombinant nudeic add of claim 4 wherein the heterologous domain is or is 
derived from a p65, VP16 or AP domain. 

9 The recombinant nudeic acid of claim 5 wherein the heterologous domain is or is 
derived from a KRAB domain or a ssn-6/TUP-1 or Kruppel family suppressor doma.n. 

1 0. The recombinant nudeic acid of claim 6 wherein the heterologous domain is or is 
derived from a GAL4, lex A or composite DNA-binding domain. 

1 1 The recombinant nucleic add of daim 7 wherein the heterologous domain is or is 
derived from an immunophilin, cydcphiiin. FRB, antibiotic resistance or hormone receptor 
domain. 

12. The recombinant nudeic acid of daim 1 1 wherein the heterologous domain is or is 
derived from FKBP, tetR. progesterone receptor or ecdysone receptor. 
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13 The recombinant nudeic add of any of claims 1. 2. 3, 4. 6. 8 or 10 wherein the fusion 
protein comprises a bundling domain, at least one transcription activation doma.n and at 
least one DNA binding domain. 

,4 The recombinant nucleic acid of any of claims 1 . 2. 3. 4. 7. 8. 1 1 or 12 wherein 
tusion protein comprises a bundling domain, at .east one transcnpt.on act.vat.on doma.n 
and at least one ligand binding domain. 

n5 . The recombinant nucleic acid of any of claims 1 , 2. 3. 5. 9. 7 1 1 or 12 wherein .the 
.usion protein comprises a bundling domain, at .east one transcnpt.cn repress.on doma.n 
and at least one ligand binding domain. 

16 The recombinant nudeic acid of any of Cairns 1. 2. 3. 6. 7. 10 11 or 12 wherein the 
ion protein comprises a bundling domain, at .east one DNA bind.ng doma.n and at .east 
one ligand binding domain. 

17 The recombinant nude.c add of any of daims 1 . 2. 3. 4. 6. 7. 8. 10. 1 1 or 12 encoding 
a fusion protein containing a bundling domain, a ligand binding doma.n. a transcnpfon 
activation domain and a DNA binding domain. 

1 8 The recombinant nucleic acid of c.aim 1 4 wherein the fusion protein contains a lac 
repressor tetramerization domain, at least one FRB domain and at least one P 65 
transcription activation domain. 

n . The recombinant nudeic add of any of daims 8. 13. 14. 17 or 18 wherein the fudon 
protein comprises at least one domain derived from a P 65 transcnpton ad.vat.on doma.n 
which contains one or more of the mutations of figure 7. 

20. A recombinant nudeic add encoding a fusion protein containing at 'east one domain 
derived from a P 65 transcription activation domain and at .east one doma.n which .s 
heterologous thereto, in which the p65-derived domain contains one or more o, the 
mutations of figure 7. 

21 • The recombinant nudeic acid of Cairn 20 wherein the heterologous domain is a .igand- 
binding domain. 

22. The recombinant nucieic add of claim 21 wherein the ligand-binding domain is or is 
derived from an FKBP. cyclophilin or FRB doma.n. 
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23. The recombinant nucleic acid of ciaim 21 wherein the ligand-binding domain is or is 
derived from a tetR domain. 

24. The recombinant nucleic acid of claim 21 wherein the ligand-binding domain is or is 
derived from a hormone receptor domain. 

25. The recombinant nucleic acid of claim 24 wherein the hormone receptor domain is a 
steroid receptor domain. 

26. The recombinant nucteic acid of claim 20 wherein the heterologous domain is a DNA 
binding domain. 

27 The recombinant nucleic acid of claim 26 wherein the DNA binding domain domain is or 
is derived from a GAL4, lex A or composite DNA-binding domain. 



28. A fusion protein 



encoded by the recombinant nucleic acid of any of claims 1-27. 



29 A nucleic acid composition comprising . . 

(a) a first nucleic acid encoding elusion protein containing a bundling domain, a 
ligand binding domain and a transcription activation domain 

(b) a second nucleic acid encoding a fusion protein containing a ligand b.nd.ng 
domain and a DNA binding domain. 

30. A nucleic acid composition comprising . 

(a) a first nucleic acid encoding a fusion protein containing a bundling domain, a 
ligand binding domain and a DNA binding domain 

(b) a second nucleic acid encoding a fusion protein containing a ligand b.nd.ng 
domain and a transcription activation domain. 

31. A nucleic acid composition comprising Hnmain a 

(a) a first nucleic acid encod.ng a fusion protein conta.n.ng a bundt.ng doma.n, a 
ligand binding domain and a transcription activation domain 

(b) a second nucleic acid encoding a DNA binding domain. 

32. The nucleic acid composition of any of claims 29 - 31 which further comprises a target 
gene operatively linked to an expression control sequence. 

33. A nucleic acid composition comprising 
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(a) a first nucleic acid encoding a fusion protein containing a bundling domain, a 
ligand binding domain, a transcription activation domain and a DNA binding 
domain 

(b) a second nucleic acid comprising a target gene operatively linked to an 
expression control sequence. 

34 A nucleic acid composition comprising 

(a) a first nucleic acid encoding a fusion protein containing a bundling doma.n.a 

transcription activation domain and a DNA binding domain 

(b ) a second nucleic acid comprising a target gene operatively linked to an 
expression control sequence. 

35 The nucleic acid composition of claim 32 or 33 which further comprises a nucleic acid 
encoding a fusion protein containing a bundling domain and a transcription activation 
domain. 

36 The nucleic acid composition of claim 34 which further comprises a nucleic acid 
encoding a fusion protein containing a bundling domain and a transcription activation 
domain. 

37 The nucleic acid composition of claim 32 which further comprises a nucleic acid 
encoding a fusion protein containing a ligand binding domain, a bundling domain and a 
transcription activation domain. 

38. A vector comprising a nucleic acid of any of claims 1 -27. 

39. A vector comprising a nucleic acid composition of any of claims 29-37. 

40. The vector of claim 38 or 39 wherein the vector is a viral vector. 

41 The vector of claim 40 wherein the vector is selected from the group consisting of 
adenoviral vectors, AAV vectors, retroviral vectors, hybrid adenovirus-AAV vectors. HSV 
vectors. 

42. The vector of claim 40 or 41 which is further packaged into recombinant virus. 

43. A composition comprising 

(a) a first recombinant virus comprising the nucleic acid composition of claim 29, 

30 or 31 
and 
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(b) a second recombinant virus comprising a target gene construct comprising a 
target gene operatively linked to an expression control sequence. 

44. A composition comprising 

(a) a first recombinant virus comprising the recombinant nucleic aad of claim 17 

and . . 

(b) a second recombinant virus comprising a target gene construct compnsing a 
target gene operatively linked to an expression control sequence. 

45. A composition comprising 

(a) a first recombinant virus comprising the recombinant nucleic acid of cla.m 13 

and . . 

(b) a second recombinant virus comprising a target gene construct compnsing a 
target gene operatively linked to an expression control sequence. 

46 The composition of claim 43 or 44 wherein the second virus additionally comprises a 
nucleic acid encoding a fusion protein comprising a bundling domain and a transcnpfon 
activation domain. 

47 The composition of claim 45 wherein the second virus additionally comprises a nucleic 
acid encoding a fusion protein compnsing a bundling domain and a transcript.on activation 
domain. 

48 The composition of claim 43 wherein the second virus additionally comprises a nucleic 
acid encoding a fusion protein comprising a ligand binding domain, a bundling domain and a 
transcription activation domain. 

49 The composition of claim 43 or 44 which further comprises a third recombinant virus 
containing a nudeic acid encoding a fusion protein comprising a bundling domain and a 
transcription activation domain. 

50 The composition of claim 45 which further comprises a third recombinant virus 
containing a nucleic acid encoding a fusion protein comprising a bundling domain and a 
transcription activation domain. 

51 The composition of claim 43 which further comprises a third recombinant virus 
containing a nucleic acid encoding a fusion protein comprising a ligand binding domain, a 
bundling domain and a transcription activation domain. 
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52 The composition of any of claims 43-51 wherein the recombinant virus is selected 
from the group consisting of adenovirus, AAV. retrovirus, hybrid adenovirus-AAV, HSV. 

53. A method for rendering cells capable of ligand-dependent transcription of a target gene 
by introducing into the cell any of the nucleic acid compositions of claims 29-33, 35 or 37 
under conditions permitting uptake by the cell of nucleic acids. 

54. A method for rendering cells capable of ligand-dependent transcription of a target gene 
by introducing into the cell any of the compositions of claims 43. 44. 46. 47. 49 or 51 . 

55. The method of claim 53 or 54 wherein the compositions are introduced ex vivo. 

56. The method of claim 53 or 54 wherein the compositions are introduced in vivo. 

57. A host cell containing a nucleic acid of any of daims 1 -27. 

58. A host cell containing a nucleic acid composition of any of claims 29-33. 35 or 37 . 

59. A host cell containing a nucleic acid composition of claim 34 or 36. 

60. A host cell containing a composition of any of claims 43. 44. 46. 47. 49 or 51 . 

61 . A host cell containing a composition of any of claims 45. 47 or 50. 

62. A host cell prepared by the method of any of daims 53-56. 

63 A method for regulating expression of a target gene by adding a cell permeant ligand 
to the host cell of any of daims 58. 60 or 62. wherein the cell permeant ligand binds to the 
ligand binding domains of the fusion proteins and activates gene expression. 

64. The method of daim 63 wherein the host cell is in a whole organism. 

65. The method of claim 64 wherein the organism is a mammal. 

66. The method of daim 65 wherein the cells are of primate origin and the mammal is a 
primate. 

67. The method of claim 66 wherein the primate is a human. 
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68 A DNA vector containing a recombinant DNA sequence comprising a first portion 
encoding a fusion protein containing a bundling domain and an additional doma.n that is 
heterologous thereto and a second portion comprising a cloning site for the insert.cn of a 
DNA sequence of interest. 

69. A cell containing recombinant nucleic acids encoding 

(a) a first fusion protein comprising a bundling domain, a transcription activation 
domain and one member of a peptide binding pair, 

(b) a second fusion protein comprising a DNA-binding domain and the other 
member of the peptide binding pair. 

wherein the peptide binding pair comprises (i) a peptide ligand and (ii) a peptide 
binding domain capable of binding to the peptide ligand, and 

wherein the cell further contains a reporter gene which is linked to an 
control sequence which permits reporter gene expression upon associat.cn of the two 
fusion proteins. 

70. A genetically engineered host cell which comprises 

(a) a reporter gene linked to a regulatable expression control element. 

(b) a first recombinant nucleic acid encoding a fusion protein compns.ng a DNA 
binding domain linked to a protein domain of interest and 

(c) a second recombinant nucleic acid comprising a cloning site linked to a 
nucleic acid sequence encoding a fusion protein containing a bundling 
domain and a transcription activation domain 

wherein association of the fusion proteins activates expression of the reporter gene. 

71 A genetically engineered host cell which comprises 

(a) a reporter gene linked to a regulatable expression control element. 

(b) a first recombinant nucleic acid encoding a fusion protein compns.ng a DNA 
binding domain linked to a protein domain of interest and 

(c) a second recombinant nucleic acid comprising a member of a test hbrary 
linked to a nucleic acid sequence encoding a fusion protein contammg a 
bundling domain and a transcription activation domain 

wherein association of the fusion proteins activates expression of the reporter gene. 

72. A genetically engineered host cell which comprises 

(a) a reporter gene linked to a regulatable expression control element, 
th) a first recombinant nucleic acid encoding a fusion protein compnsmg a 
transcription activation domain linked to a protein domain of interest and 
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(c) a second recombinant nucleic acid comprising a cloning site linked to a 
nucleic acid sequence encoding a fusion protein containing a bundling 
domain and a DNA binding domain 
wherein association of the fusion proteins activates expression of the reporter gene. 

73 A genetically engineered host cell which comprises 

(a) a reporter gene linked to a regulatable expression control element. 

(b) a first recombinant nucleic acid encoding a fusion protein compris.ng a 
transcription activation domain linked to a protein domain of interest and 

(c) a second recombinant nucleic acid comprising a member of a test library 
linked to a nucleic acid sequence encoding a fusion protein containing a 
bundling domain and a DNA binding domain 

wherein association of the fusion proteins activates expression of the reporter gene. 

74. A method for identifying a mo.ety capable of binding to a protein or protein domain of 
interest which comprises the steps: 

(a) contacting genetically engineered cells of claims 69-73 with members of a 
combinatorial library under suitable conditions permitting gene expression, 

(b) observing the presence and/or amount of expression of the reporter gene, 
and 

(c) correlating the presence and/or amount of reporter gene expression with 
contact of cells with one or more individual members of the combinatorial 
library. 
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